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Abstract. We prove a quantitative bi-Lipschitz nonembedding theorem for the Heisenberg 
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integrality gap of the Goemans-Linial semidefinite relaxation of the Sparsest Cut problem. 



Contents 

1. Introduction and statement of main results 

2. The proof of Theorem II .!( overview and background 

3. Preliminaries [Til 

4. Reduction to the stability of individual monotone sets [l8| 

5. Cuts with small perimeter [13 

6. The kinematic formula and J-monotone sets [24 

7. The quantitative interior and boundary [26| 

8. Classification of monotone sets [13 

9. Proof of the stability of monotone sets [36| 

10. Nondegeneracy of the initial configuration. 



11. Proof of Lemma PI 47 



12. Proof of Proposition 17.101 |5S 



13. Appendix 1: The Sparsest Cut problem [61 



14. Appendix 2: Quantitative bounds, coercivity and monotonicity [64 



References K3f 



J.C. was supported in part by NSF grant DMS-0704404. 
B.K. was supported in part by NSF grant DMS-0805939. 

A. N. was supported in part by NSF grants CCF-0635078 and CCF-0832795, BSF grant 2006009, and 
the Packard Foundation. 

1 



1. Introduction and statement of main results 



Theorem 11.11 the main result of this paper, is a quantitative bi-Lipschitz nonembedding 
theorem, in which the domain is a metric ball in the Heisenberg group, H, with its Carnot- 
Cartheodory metric, d m , and the target is the space L\, for the definition of d m see the 
subsection of Section [2] entitled: "The Heisenberg group as a PI space" . This result has 
consequences of a purely mathematical nature, as well as for theoretical computer science. 

Define c p (X, d x ), the L p distortion of the metric space (X, d ), to be the infimum of those 
D > for which there exists a mapping / : X — > L p satisfying —^M^ 2 £ [1 3 D] for all 
distinct x,y E X. The quantitative study of bi-Lipschitz embeddings of finite metric spaces 
in L p spaces goes back to (50J [28] • The modern period begins with a result of Bourgain, [12J, 
who answered a question of [3H] by showing that for every fixed p, any n-point metric space 
can be embedded in L p with distortion < lognj^ By [53], Bourgain's theorem is sharp for 
any fixed p < oo. 

Since Li, equipped with the square root of its usual distance, is well known to be isometric 
to a subset of L 2 (see for example [63]) it follows that if (X, d x ) isometrically embeds in 
Li, then (X, \fd^) isometrically embeds in L 2 . Metrics for which (X^y/d^) isometrically 
embeds in L 2 are said to be of negative type. Such metrics will play a fundamental role in 
our discussion. On the other hand, it is also well known that L 2 embeds isometrically in L\ 
(see for example [61]) which implies that c\(X, d x ) < c 2 (X, d x ), for all (X, d x ). Recently, it 
was shown that for n-point metric spaces of negative type, c 2 (X, d x ) < {\ogn) 1 ^ 2+ °^ l \ and 
in particular Ci(X, d x ) < (logn) 1 ^ 2+ °^; see [5J, which improves on a corresponding result in 
[15] . For embeddings in L 2 , the result of [5J is sharp up to the term o(l); see [28] . 

As shown [IS] , the Carnot-Caratheodory metric d m is bi-Lipschitz to a metric of negative 
type. From this and Corollary 11.51 of Theorem 11.11 below, it follows immediately that for 
all n, there exist n-point metric spaces of negative type with ci(X, d x ) > (\ogn) s , for some 
explicit 5 > 0. From the standpoint of such nonembedding theorems, the target L\ presents 
certain challenges. Lipschitz functions / : R — ► L\ need not be differentiable anywhere. 
Therefore, a tool which is useful for L p targets with 1 < p < oo is not available. Moreover, 
the fact that L 2 embeds isometrically in L\ implies that bi-Lipschitz embedding in L\ is no 
harder than in L 2 and might be strictly easier in cases of interest. 

The Sparsest Cut problem is a fundamental NP hard problem in theoretical computer 
science. This problem will formally stated in Section [12] (an appendix) where some addi- 
tional details of the discussion which follows will be given; see also our paper [21] (and the 
references therein) which focuses on the computer science aspect of our work. A landmark 
development took place in the 1990's, when it was realized that this optimization problem 
for a certain functional defined on all subsets of an n-vertex weighted graph, is equivalent to 
an optimization problem for a corresponding functional over all functions from an n-vertex 



1 In this paper, the symbols <, >, denote the corresponding inequalities, up to a universal multiplicative 
constant, which in all cases can be explicitly estimated from the corresponding proof. Similarly, x denotes 
equivalence up to such a factor. 
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weighted graph to the space L\\ see [101 EH [9]. This is a consequence of the cut cone rep- 
resentation for metrics induced by maps to L±, which also plays a fundamental role in this 
paper; see Section [TJ Once this reformulation has been observed, one can relax the problem 
to an optimization problem for the corresponding functional over functions with values in 
any n-point metric space. The relaxed problem turns out to be a linear program, and hence, 
is solvable in polynomial time. Define the integrality gap of this relaxation to be the supre- 
mum over all n point weighted graphs of the ratio of the solution of the original problem to 
the relaxed one. The integrality gap measures the performance of the relaxation in the worst 
case. It is essentially immediate that the integrality gap is < the supremum of ci(X, d x ) 
over all n point metric spaces, (X,d x ), and hence, by Bourgain's theorem is < logn; see 
[53| [9]. This upper bound relies only on the form of the functional and not on any special 
properties of L\. A duality argument based on the cut cone representation shows that the 
integrality gap is actually equal to the supremum of Ci(X, d x ) over all n point metric spaces. 

Subsequently, Goemans [31] and Linial [5T] observed that if in relaxing the Sparsest Cut 
problem as above, one restricts to n point metric spaces of negative type, one obtains a 
semidefinite programing problem, which, by the Ellipsoid Algorithm, can still be solved in 
polynomial time with arbitrarily good precision; see [3UJ. As above, by the duality argu- 
ment, the integrality gap for the Goemans-Linial semidefinite relaxation is actually equal 
to the supremum of C\{X, d x ) over all n point metric spaces of negative type. Based on 
certain known embedding results for particular metric spaces of negative type, the hope was 
that this integrality gap might actually be bounded (the "Goemans-Linial conjecture") or 
any case, bounded a very slowly growing function of n. At present, one knows the upper 
bound < (logn) 1/ ' 2+0 ^ 1 ' ) , which follows from from [5]. This result makes the Goemans-Linial 
semidefinite relaxation the most successful algorithm to date for solving the Sparsest Cut 
problem to within a definite factor. In the opposite direction, it was shown in [13] that the 
integrality gap for the Goemans-Linial semidefinite relaxation is > log log n. The analysis 
of [13] , improved upon that of the breakthrough result of [H] , which was the first to show 
that the Goemans-Linial semidefinite relaxation cannot yield a constant factor approxima- 
tion algorithm for the Sparsest Cut problem, thus resolving the Goemans-Linial conjecture 
[2B EH E2] ■ These lower bounds on the integrality gap depend on its characterization as the 
supremum of Ci(X, d x ) over all n point metric spaces of negative type. 

Motivated by the potential relevance to the Sparsest Cut problem, the question of whether 
(H, d m ) bi-Lipschitz embeds in L\ was raised in [18]. In response, it was shown in [18] that 
if U C HI is open and / : U — > L\ is Lipschitz (or more generally of bounded variation), 
then for almost all x £ U (with respect to Haar measure) and y varying in the coset of the 

center of HI containing x, one has lim^a, ^^-prz^r^ = 0. Thus, (H, d m ) does not admit a 
bi-Lipschitz embedding into L%. 

For further applications as discussed above, a quantitative version of this theorem of [18] 
was required; see Theorem 11.11 belowj^ It follows from Corollary 11.51 of Theorem 11.11 that 
there exists a sequence of n-point metric spaces, (X n ,d Xn ), of negative type, such that 



For purposes of exposition, in Theorem 1 1.11 we restrict attention to the case of Lipschitz maps, although 
everything we say has an analogous statement which apply to BV maps as well, sometimes with minor 
variations. 
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c\(X n , d Xn ) > (\ogn) s and hence, that the integrality gap for the Goemans Linial relaxation 
of Sparsest Cut is > (logn) 5 for some explicit 5 > 0; compare Remark 1 1.71 This represents 
an exponential improvement on the above mentioned lower bound > log log n; compare also 
the upper bound < (logn) 1 / 24 ^ 1 ). 

We also give a purely mathematical application of Theorem 11.11 to the behavior of the 
L\ compression rate of the the discrete Heisenberg group. The L\ compression rate is a 
well studied invariant of the asymptotic geometry of a finitely generated group, which was 
defined by Gromov in [32J; see below for the definition. 

In what follows, given p G H and r > 0, we denote the oF-open ball of radius r centered 
at p by B r (p) = {i6i: d m (x,p) < r}. 

Theorem 1.1 (Quantitative central collapse). There exists a universal constant 5 G (0, 1) 
such that for every pel, every f : B\(p) — ► L\ with Lip(/) < 1, and every e G (0, j), 
there exists r > e such that with respect to Haar measure, for at least half% of the points 
x G Bi{p), at least half of the points (xi,x 2 ) G B r (x) x B r (x) which lie on the same coset of 
the center satisfy: 

||/(Zl)-/(z2)|k 1 

dF{x u x 2 ) ~ (log(l/e))*' 

In particular, compression by a factor of r\ G (0, |) is guaranteed to occur for a pair of 
points whose distance is > e v c , where c = <5 _1 (5 as in ( 11.2D ). 

The constant 5 in Theorem II .11 can be explicitly estimated from our proof; a crude estimate 
is 5 = 2 -60 in Theorem 11.11 At various points in the proof, we have sacrificed sharpness in 
order to simplify the exposition; this is most prominent in Proposition ^. lOl below. which must 
be iterated several times, thus magnifying the non-sharpness. Obtaining the best possible S 
in Theorem 11.11 remains an interesting open question, the solution of which probably requires 
additional ideas beyond those contained in this paper. 

Before discussing the consequences of Theorem 11.11 we briefly indicate the reason for the 
form of the estimate (|1.2|) : see also the discussion of Section [2j We will associate to / a 
nonnongative quantity, the total nonmonotonicty, which can be written as a sum over scales, 
and on each scale as an integral over locations. The assumption Lip / < 1 turns out to 
imply an a priori bound on the total nonmonotonicity. Since for e G (0, 1), the total number 
of scales between 1 and e is x log(l/e), by the pigeonhole principle, there exists a scale as 
in (11.21) . such that at most locations, the total nonmonoticity is < l/(log(l/e)). We show 
using a stability theorem, Theorem 14. 131 that for suitable 5, this gives fll.2p . This discussion 
fits very well with a quantitative result (and phenomenon) due in a rather different context 
to Jones; see [10] and compare the discussion in Section [2j As explained in Section [TH the 
argument indicated above can be viewed as a particular instance of one which is applicable 
in considerable generality. 



In Theorem 11.11 by suitably changing the constant 5, "at least half" can be replaced by any definite 
fraction. 
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(1.2) 



The discrete case. As we have indicated, Theorem 11.11 has implications in the context 
of finite metric spaces. These are based on properties of the discrete version of Heisenberg 
group, H(Z) (defined below) and its metric balls, which follow from Theorem II. 11 



We emphasize at the outset that our results in the discrete case are obtained (without 
difficulty) directly from the corresponding statements in the continuous case, and not by a 
"discretization" of their proofs. As in [18] and in [23] , where a different proof of the nonquan- 
titative result of [18] is given, the proof of Theorem 11.11 is carried out in the continuous case 
because in that case, the methods of real analysis are available. Nonetheless, in the present 
instance, the quantitative issues remain highly nontrivial and the proof requires new ideas 
beyond those of [18], [23]; see the discussion at the beginning of Section [2] and in particular 
Remark EH 

We will view EI as M 3 equipped with the noncommutative product (a, b, c) • (a', b', d) = 
(a + a',b + b',c + d + ah 1 — ba!)\ for further discussion of H, see Section [2J From the 
multiplication formula, it follows directly that for R > 0, the map, Ar : R 3 — ■> R 3 , defined by 
An((a, b, c)) = (Ra, Rb, R 2 c) is an automorphism of EL It is also a homothety of the metric 
d m . The discrete Heisenberg group, H(Z), is the integer lattice, Z 3 = {(a,b,c) \ a, b, c £ Z}, 
equipped with the above product. It is a discrete cocompact subgroup of EL For further 
discussion of the Heisenberg group see Section [2j 

Fix a finite set of generators T of a finitely generated group, T. The word metric, d? on T 
is the left invariant metric defined by stipulating that dr(gi, #2) is the length of the shortest 
word in the elements of T and their inverses which expresses g± g\. Up to bi-Lipschitz 
equivalence, the metric d^ is independent of the choice of generating set. For the case of 
H(Z), we can take T = {(±1, 0, 0), (0, ±1, 0), (0, 0, ±1)}. For definiteness, from now on this 
choice will be understood. By an easy general lemma, given a free co-compact action of 
a finitely generated group T acting freely and cocompactly on a length space (X, d x ), the 
metric on T induced by the restriction to any orbit of the metric d x is bi-Lipschitz equivalent 
to d T ; see [H]. For the case of H(Z), we can take (X, d x ) = (H, d m ). 

Define p : M 3 x M 3 -> [0, 00) by 
p((x,y,z), (t,u,v)) 

(1.3) 
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((* - xf + (u- y) 2 ) +(v~ z + 2xu- 2yt) 2 + (t - x) 2 + (u-y) 

It was shown in [IS] that (EI, p) is a metric of negative type bi-Lipschitz to (EI, d m ) . 

It follows from [18] that 
(1.4) lim ci ({0, . . . , n} 3 , p) = 00 , 

n— >oo 

but no information can be deduced on the rate of blow up as n —>■ 00. From Theorem 11.11 
we get the following corollary, whose proof will be explained at the end of this subsection. 

Corollary 1.5. For the constant 5 > in Theorem \l.l\ we have for alln £ N, metric spaces 
({0, . . . , n} 3 , p) of negative type, statisfying 

(1.6) Cl ({0,...,n} 3 ,p) > {\ognf. 
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Remark 1.7. Since the metric spaces c\ ({0, . . . , n} 3 ,p) are of negative type, relation (11.61) 
implies that the integrality gap of the Goemans Linial relaxation of Sparsest Cut is > (\ogn) s 
for 5 > as Theorem 11.11 

Remark 1.8. A metric space is said to be doubling if a metric ball B 2r (x), can be covered by 
at most N < oo metric balls of radius r, where N is independent of x, r. For n-point metric 
spaces, (X,d), which are doubling, the bound in Bourgain's theorem can be sharpened to 
Ci(X, d) < C2(X,d) < Vlog n, which follows from the results of [El [58]; see the explanation 
in [35]. The metric spaces, (H, d m ) and (H(Z), dy) are doubling. (To see this, use for example, 
the left invariance of d m and the homotheties Ar.) Before the bi-Lipschitz non-embeddability 
of HI into Li was established in [18] , there was no known example of a doubling metric space 
which does not admit a bi-Lipschitz embedding into L\. Corollary 11.51 shows that there is a 
sequence of n-point doubling metric spaces for which c\(X,d) > (logn) 5 . 

Remark 1.9. The behavior of the L 2 distortion for n-point doubling metric spaces is much 
easier to understand than the Li distortion. Namely, for fixed doubling constant, the above 
mentioned bound, c 2 (X, d) < y/\ogn cannot be improved; see [16], and for dependence on 
the doubling constant, [12]. 

Given a finitely generated group T and a 1-Lipschitz function / : (r,dr) — > L%, Gromov 
( [32] ) defined the compression rate Uf : [1, oo) — > [0, oo) by: 

u f {t) := inf {||/(x) - /(y)|| Ll | d T (x,y) > t} . 

Stated differently, uif is the largest non-decreasing function for which \\f(x) — f(y)\\Li — 
ujj {dr{x, y)) for all x, y G T. 

It follows from [18] that for any 1-Lipschitz map / : H(Z) — > L\ we have U)f(t) = o(t), 

but [18J does not give any information on the rate at which tends to zero. From 

Theorem 11.11 we can obtain the bound: 

Corollary 1.10. For every f : H(Z) — > L\ which is 1-Lipschitz with respect to the word 
metric dw, we have for all t > 1: 

where 5 > is the constant in Theorem \l.l[ 

Remark 1.12. By a general result from [62] (see Corollary 5 there) which implies that if an 
increasing function uo : [1, oo) — > [0, oo) satisfies ^-dt < oo then there exists a mapping 
/ : H(Z) — > Li which is Lipschitz in the metric dx and u)f > oo. In fact, / can be chosen 
to take values in the smaller space L 2 . By choosing oj{t) = v / 1+logf . 1 o glog ( 2 +t) > anc ^ bringing 
in Corollary 11.101 it follows that in the terminology of [7|, the discrete Heisenberg group 
(H(Z),d T ) has L\ compression gap 

( t t 

\yj\ + log t ■ log log(2 + t) ' (1 + log t) S 

It would be of interest to evaluate the supremum of those 5 > for which there exists 
/ : H(Z) -> Li which satisfies fTTTTTTl . L 2 ). 
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We close this subsection by explaining how Corollaries 11.101 and 11.51 are deduced from 
Theorem 11.11 A key point is to pass from the discrete settings of these corollaries to the 
continuous setting of Theorem 11.11 via a Lipschitz extension theorem. The basic idea is 
simple and general (see Remark 1.6 in [IB]). Additionally, the homotheties A R {a, b, c) = 
(Ra, Rb, R 2 c) are used to convert information from Theorem 11.11 concerning small scales, 
into information concerning large scales. The existence of these homotheties is, of course, a 
special property of (H, gF). 

We will give the details for Corollary II. 10( the case of Corollary 11.51 is entirely similar. 

Proof, (of Corollary ll.lOp Fix a map / : H(Z) — > L\, which is 1-Lipschitz with respect to 
the word metric d$- The map / can be extended to a map / : H — > L\ whose Lipschitz 
constant with respect to d m satisfies Lip / < 1. This fact follows from the general result 
of |47j . which states that such an extension is possible for any Banach space valued mapping 
from any doubling subset of a metric space to the entire metric space, but in the present 
simpler setting it also follows from a straightforward partition of unity argument. 

Fix R > 1 and define g R : ^((0, 0, 0)) -> L x by g R (x) = ±f(6 R (x)). Since d m (A R (x), A R (y)) 
= Rd n (x,y), Lip (g R ) < 1. For e £ (0, 4), an application of Theorem 11.11 shows that there 
exist x, y £ H such that R > d m (x, y) > eR, and 

(1.13) f(x)-f(y) T < -(Pfoy). 

la (log(l/e)) d 

Choose e = -^=. Since there exist a, b £ Z 3 such that d m (a,x) < 1 and d a (b,y) < 1, and 
since Lip / < 1, it follows from (11.131) that provided R is large enough, 

cj f {d w (a,b)) < \\f(a) - f(b)\\ Ll < — ? < - — s , 

(\ogR) d (log (d m (a,b))) d 

implying (11.111) . □ 



2. The proof of Theorem ll.lt overview and background 



We begin with an informal overview of the proof of Theorem 11.11 Then proceed to a more 
detailed discussion, including relevant background material. 

Many known bi-Lipschitz non-embedding results are based in essence on a differentiation 
argument. Roughly, one shows that at almost all points, in the infinitesimal limit, a Lipschitz 
map converges to a map with a special structure. One then shows (which typically is not 
difficult) that maps with this special structure cannot be bi-Lipschitz. The term "special 
structure" means different things in different settings. When the domain and range are 
Carnot groups as in Pansu's differentiation theorem [57], special structure means a group 
homomorphism. As observed in [IHl [19] , Pansu's theorem extends to the case in which the 
domain is (H, d m ) and the target is an infinite dimensional Banach space with the Radon- 
Nikodym property, in particular, the target can be L p for 1 < p < oo. 

The above approach fails for embeddings into L\, since even when the domain is R, Lips- 
chitz maps need not be differentiable anywhere. A simple example is provided by the map 
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t — > X[o,t], where X[o,t] denotes the characteristic function of [0,i]; see [3]. Nevertheless, the 
result on central collapse proved in [18] can be viewed as following from a differentiation the- 
orem, provided one interprets this statement via a novel notion of "infinitesimal regularity" 
of mappings introduced in [T8] . 

The approach of [18J starts out with the cut-cone representation of L\ metrics (see [271 [T8"] ). 
which asserts that for every / : EI — > L\ we can write 



for all x, y, where £/ is a canonically defined measure on 2 H (see the discussion following (12.71) 
for precise formulations). The differentiation result of [18] can be viewed as a description 
of the infinitesimal structure of the measure £/. It asserts that at most locations, in the 
infinitesimal limit, is supported on vertical half spaces. This is achieved by first showing 
that the Lipschitz condition on / implies that £ f is supported on special subsets, those with 
finite perimeter. One then uses results on the local structure of sets of finite perimeter in 
the Heisenberg group to complete the proof; the argument is described in greater detail later 
in this section. 

An alternative approach to this result on the infinitesimal behavior of £/, which does not 
require the introduction of sets of finite perimeter, was obtained in [23]. This approach is 
based on the classification of monotone sets. A subset E C EI is called monotone if for every 
horizontal line L, up to a set of measure zero, E H L is a sub-ray of the line. If we view 
EI = R 3 , the horizontal lines are a certain codimension 1 subset of all the lines; for details, 
see the subsection entitled "The Heisenberg group as a PI space" . The proof of [23] proceeds 
by showing that infinitesimally the measure S/ is supported on monotone subsets, and a 
non-trivial classification theorem which asserts that monotone subsets are half spaces. (The 
proof is recalled in Section 0) 

Our proof of Theorem 11.11 combines the methods of [18] and [23j , with several significant 
new ingredients. Inspired by the proof in [23], our argument is based on an appropriately 
defined notion of almost monotonicity of subsets. But, unlike [23J, we require the use of 
perimeter bounds as well. In our situation, perimeter bounds are used in finding a controlled 
scale such that at most locations, apart from a certain collection of cuts, the mass of £/ 
is supported on subsets which are sufficiently close to being monotone. In actuality, the 
excluded cuts may have infinite measure with respect to S/. Nonetheless, using perimeter 
bounds and the isoperimetric inequality in EI, we show that their contribution to the metric 
is negligibly small. 

A crucial, and by far the most complicated, new ingredient in this paper is a stability 
version of the classification of monotone sets of [23] , which asserts that sets which are almost 
monotone are quantitatively close to half spaces. One of the inherent reasons for the diffi- 
culty of proving such a stability result is that the need to work locally, i.e. to consider almost 
monotone subsets inside a metric ball in EI of finite radius. Here the situation is fundamen- 
tally different from the corresponding classification result in [23]: There are even precisely 
monotone subsets in such a ball which are not half spaces^; see Example 19 . 11 Inevitably, our 

In E™, monotone subsets of a ball are necessarily the intersection of the ball with a half space. 
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classification result must take this complication into account. We can only assert that on a 
(controllably) smaller ball, the given almost monotone subset is close to a half space. 



In order to make the above informal description precise, we will require some additional 
preliminaries, which are explained in the remainder of this introduction. For a discussion, 
in a wider context, of estimates whose form is equivalent to that of (11.21) . and which arise 
from our general mechanism, see Appendix 2, Section [TH 

Remark 2.1. As we explained above, the present paper belongs to the general topic of con- 
trolling the scale at which the infinitesimal "special structure" of a mapping (arising from 
a differentiation theorem) appears in approximate form. However, it is important to realize 
that it is impossible in general to obtain a quantitative estimate for the rate at which the 
limiting situation is approached. To see this point, consider a model problem associated to 
the sequence of functions f n (x) = S1 ^ x ) . Although \f' n (x)\ < 1, for all n 6 N and x G 1, it 
is not possible control independently of n the difference between f n and its first order Taylor 
series; note that which controls the remainder term, is not bounded independently of 
n. Nonetheless, there is an explicit uniform estimate for the scale above which any f n is 
approximated by some linear function, which might not be equal to its first order Taylor 
series. As far as we are aware, the first instance of a result of this type is in |40J; see also for 
example [UJ, and for additional information, the appendix by Semmes in [53] . 

PI spaces. A natural setting for a large part of our work is that of PI spaces: a certain 
class of metric measure spaces, of which the Heisenberg group is a member. Together with 
the key concept of "upper gradient" on which their definition is based, these spaces were 
introduced in [38] . 

Let (A, d x ) denote a metric space. If / : X — > R, then g : X — » [0, oo] is called an upper 
gradient of /, if for all rectifiable curves c : [0,£] — > X, parameterized by arc length, 



A complete metric measure space (X, d x , fi) is called a PI space if a doubling condition and 
Poincare inequality hold. Thus, for all R > there exist (3 = /3(R), r = r(R),A = A(R),p > 
1, such that for all x G X and r < R we have: 



(2.2) 







B r (x)xB r (x) 



\f(xi) - f(x 2 )\d(ii x ii)(x 1 ,x 2 ) < r ■ r ■ 



B Ar (x) 



> p dn 



p 



where we use the notation: 
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As shown in [36], f 1 2 . 3 p and (12. 4p imply a strengthening of (12 .4p . which we will need. 
Namely, for some x = x{Pi T ) > 1, t' = r'(/3, r), one has the Poincare-Sobolev inequality, 



(2.5) (7 |/(i 1 )-/(i 2 )p%x/ 1 )(x 1 ,i 2 )) XP <t't- (7 <f d/i 

\JB r {x)xB r (x) / \JB Ar (x) 

For the case of the Heisenberg group of dimension 2n + 1, relation (12. 5p holds with x = |^if j 
thus if 2n + 1 = 3, then x — f • 

Observe that (12.41) . for fixed p, implies (12.41) for any p' > p. In this paper, without further 
explicit mention, we will always assume that p — 1, which is necessary for the ensuing results 
on finite perimeter. Solely in order to have one fewer constant to list, we will also make the 
innocuous assumption that A = 1. For the cases of primary interest here, X = K" or X = H, 
relation (12.51) does hold with p = 1, A = 1. 



Maps to L\i cut metrics and cut measures. Let X denote a set and let E C X denote 
a subset. In place of "subset", we will also use the term "cut". If E C X, then associated to 
E is a so-called elementary cut metric, denoted cIe, on X, where by definition, (1e(xi, x 2 ) = 
if either x\, x 2 both lie in E, or if x\, x 2 both lie in E' (here, and in what follows, we denote 
E' = X\E). Otherwise, (1e(xi, 22) = 1- 

Remark 2.6. Clearly, d,E = For this reason, it is common to define the space of cuts of 
X as the quotient of the power set 2 X by the involution, E 1— > E' , although we don't do this 
here. 



By definition, a cut metric is an integral of elementary cut metrics with respect to some 
measure E on 2 X . Thus, 

(2.7) d-z(xi,x 2 ) = \ d E (x 1 ,x 2 )dH{E) . 

J2 X 

For f : U —* L\, the pull-back metric induced by / is defined as df{xi,x 2 ) = \\f{xi) — 
f(x 2 ) ||i- It follows from the "cut-cone characterization" of L\ metrics (see [2"7] [T%]) that 
there is a canonically defined measure Sj on 2 X (on an associated sigma algebra) such that 
( g77D holds with E = £/. 

Now let (X, /i) denote a cr-finite measure space. Let U C X denote a measurable subset 
and / : U — > Li(Y, v) be a map which satisfies: 

/ II/OOIM// < 00. 

Ju 

There is a variant of the description of df in terms of Sj in the L\ framework; see [TSj. In 
this context, 2 U is replaced by a measure theoretic version of the space of cuts. One should 
regard a cut as an equivalence class of measurable sets E C U of finite /i-measure, where 
two sets are considered equivalent if their symmetric difference has measure zero. For our 
purposes, there is no harm in blurring the distinction between measurable sets and their 
equivalence classes, which for purposes of exposition, is done below. 
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Let Cut(U) denote the space of cuts. As above, we have: 

(2.8) d f (x 1 ,x 2 )= / d E (x 1 ,x 2 )dE f (E) , 

JCnt(U) 

where E/ is a suitable Borel measure on 2 U . Here we view df as an element of L X ° C {U x U), 
whose restriction to any subset V C U, with finite measure, lies in L\(V x V). Given two such 
Li-metrics d±,d 2 G Li(V x V) there is a well-defined Li-distance, denoted ||<ii — d 2 \\ Ll {y xV )i 
given by: 

(2.9) ||g?i - ^Hl^vxV) = / \d 1 (x 1 ,x 2 ) - d 2 (x lj x 2 )\d(iJ, x fi)(x 1 ,x 2 ). 

JVxV 

It is shown in |18j . that if X is a PI space, [/ c Bi(p) C I is an open set, and / is 
1-Lipschitz, or more generally, 1-BV, then the cut measure £/ has finite total perimeter: 

(2.10) f Pex{E)(U)= I PER(E,U)dJ: f (E) <c(/3,r). 

JC\it(U) JCnt{U) 

Intuitively, the perimeter, PEK(E, U), is the codimension 1 measure of the (measure 
theoretic) boundary of E fl U . In actuality, V — > PER(E fl V) defines a Radon measure 
Per(E). The restriction of this measure to subsets of U is denoted Per(E, U). The mass 
of Per(_E, U) is denoted PER(_E, U) and is equal to Per (E)(U); for the definition, see (13. 5p . 
Moreover, there is a total perimeter measure, A/, which is a Radon measure on U, such that 

(2.11) A/= f Pei(E,U)dZ f (E), 

JCnt(U) 

(2.12) Mass(A/) = X f {U) = I PER(E, U) dE f (E) . 

JCut(U) 

In particular, if / has bounded variation (f G BV) then is supported on cuts with finite 
perimeter. For the precise definitions and relevant properties of cut measures, BV maps and 
perimeter measures, see [T8] . 

The Heisenberg group as a Lie group. Recall that the 3-dimensional Heisenberg group 
H, can be viewed as M 3 equipped with the group structure 

(2.13) (a, b, c) ■ (a', b' , c') = (a + a , b + b' , c + c' + ab' — ba') . 

Note that the inverse of (a, b, c) is (— a, — 6, — c). The center Center (H) consists of the 1- 
dimensional subgroup {0} x {0} x M. There is a natural projection 7r : EI — >• Hi/Center (H) = 
R and the cosets of the center are just vertical lines in M 3 , i.e., lines parallel to the z-axis. 

Since correction term, ab'—a'b, in (12.131) . which measures the failure of the multiplication to 
be commutative, can be viewed as a determinant, we get the following very useful geometric 
interpretation: 

(*) The correction term, ab' — a'b, is the signed area of the parallelagram spanned by the 
vectors ir((a,b,c)), ir((a',b',c')). 
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(**) Equivalently, if we regard R 3 as R 2 x R 7 then oh' — a'b is the standard symplectic form, 
u, on R 2 . 

Let K : R 2 — > R 2 be an invertible linear transformation and set 

(2.14) A K (a, b, c) := (K(a, b), (det K) ■ c) . 
It is easily checked that A K is an automorphism of H. 

Let H g denote affine plane passing through jel, which is the image under left multipli- 
cation by g G H, of the subspace R 2 x {0} C H. We call H g the horizontal 2-plane at g e H. 
Since the automorphism Ak preserves the subspace R 2 x {0} C H e , where H e denotes the 
tangent space at the identity e = (0,0,0), it follows that Ak maps horizontal subspaces 
to horizontal subspaces. The collection {H g } g€ u defines a left-invariant connection on the 
principle bundle, R — > H — > R 2 , which in coordinates, has the following explicit description. 

The plane i?(o,o,o) is given by (u, t>,0) where v take arbitrary real values. In general, 
H(a,b,c) = (o, b, c)(u', v', 0) = (a + -u', b + v',c — bu + a?/), 
so putting a + u' — u, b + v' — v, we get 

(2.15) #(a,b,c) = (u,v,c + av - bu) . 

The affine 2-planes in R 3 whose projections to R 2 are surjective are just those which admit a 
parameterization (u, v, c + av — bu). It follows that every such 2-plane arises as the horizontal 
2-plane associated to a unique point (a, b, c), and conversely, that every horizontal 2-plane 
associated to some point in R 3 projects surjectively onto R 2 . 

A line in R 3 , which passes through the point (a, b, c), and lies in the plane i?( a ,b, c )) can De 
written 

(2.16) L = (a, b, c) + t(u, v, —bu + av) tel, 

where -u, f are fixed and t varies. In discussing the Heisenberg case, unless otherwise indi- 
cated, the term line will refer exclusively to such a horizontal line. The collection of all such 
lines is denoted lines (H). 

Definition 2.17. A half-space V C EI = R 3 is the set of points lying on one side of some 
2-plane P, including those points of the plane itself. 

The half-space 7i is called horizontal if its associated 2-plane H is horizontal. Otherwise 
it is called vertical. Thus, a vertical half-space is the inverse image under n of an ordinary 
half plane in R 2 . 

The Heisenberg group as a PI space. Consider the left-invariant Riemannian metric 
(•, •) on H which corresponds to the standard Euclidean metric at the tangent space to 
the identity. The Carnot-Caratheodory distance d m (xi,X2) is defined to be the infimum of 
lengths of curves c : [0,£] — > H, from x\ to x 2 , such that for all s, the tangent vector c'(s) is 
horizontal i.e. c'(s) lies in the 2-dimensional subspace of the tangent space corresponding to 
the affine plane if c ( s ). The length of c'(s) is calculated with respect to ( • , • ). In particular, 
one sees that for all gi, g 2 € H, 

(2.18) o M {g l ,g 2 ) = o M {g- l \g 2 1 ) . 
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A well known consequence (see [56]) of the definition of d m , in combination with (*), (**) 
above, is that if p, q G H, and c : [0, £] — > M? is a curve parametrized by arc length such 
that its horizontal lift c starts at p and ends at q (so that n(p) = vr(g)), then the vertical 
separation of p and q (in coordinates) is the signed area enclosed by the curve c. 

Geodesies of d H can be characterized as those smooth horizontal curves which project 
(locally) either to a circular arc or to a line segment in K 2 . Thus, the lines L G lines (HI) 
are precisely those geodesies of d m which are affine lines in HI = M 3 . Any two points of 
HI can be joined by a minimal geodesic of d m , although typically, not by a horizonal line 
L G lines (HI). Note that when viewed as a curve in (H, cP), any affine line which is not a 
member of lines (HI) has the property that any of its finite sub-segments has infinite length. 

The distance d m ((a, b, c), (a',b',cf)), is bounded above and below by a constant multiple 

of 

(2.19) ((a - a') 2 + (b - b') 2 )^ + \c-c' + ab' - ba'fi . 

When restricted to any vertical line is just the square root of the coordinate distance. Thus, 
there is a constant C > such that for the metric d m , the metric ball about e = (0, 0, 0) 
satisfies the box-ball principle: 

(2.20) B c -i r (e) C |(a,6,c)||a| < r, \b\ <r,\c\ <r^| C B Cr (e) . 

Thus, in coordinates, B r (e) looks roughly like a cylinder whose base has radius r and whose 
height is 2r 2 . 

In this paper we will consider the PI space (HI, gF,£ 3 ), where £3 denotes Haar measure 
on HI, which coincides with the Lebesgue measure on M. 3 . 

Let re denote rotation in M 2 through an angle 9 and let / denote the identity on IR 2 . From 
now on we write Og for A rg . Since Og preserves horizontal subspaces and induces an isometry 
on H e = M 2 x {0}, it follows that Og is an isometry of (HI, d m ). Clearly, Og preserves the 
measure £3 as well. Similarly, for ip G IR, the automorphism A^j scales the metric by a 
factor if) and the measure £3 by a factor ip 4 ; see (12.141) . 

Behavior under blow up of finite perimeter cuts of HI. Let £ C 5i(e) C H denote a 
finite perimeter (FP) cut. Then at Per(£')-a.e. p G E, asymptotically under blow up, the 
measure of the symmetric difference of E and some vertical half-space V, goes to zero; see [29], 
[5U] . Equivalently, the indicator xe converges to xv i n the L l ° c sense. The corresponding 
theorem for IR n is due to De Giorgi [22], [2S]- The results of [2S], [30] depend essentially 
on those of [I], [2], in which an asymptotic doubling property for the perimeter measure is 
proved for arbitrary PI spaces. 

If dy is an elementary cut metric associated to a vertical half-space then the restriction of 
d\> to a coset of the center is trivial. Thus, the results of [29], [30], together with (12. 7p . suggest 
that under blow up, at almost all points, a Lipschitz map / : HI — > L\ becomes degenerate in 
the direction of (cosets of) the center. In particular, there exists no bi-Lipschitz embedding 
of HI in L\. This is the heuristic argument behind the main result of |18j . 
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In order to prove Theorem II. 1[ we will take a different approach, leading to a quantitative 
version of a somewhat crude form of the blow up results of [29] , [30] (and the corresponding 
earlier results of [25], [26]). Here, "crude" means that our argument does not give uniqueness 
of the blow up, nor in the Heisenberg case, does it show that only vertical half-spaces arise. 
[3D] . For our purposes, neither of these properties is needed. Our approach is based on the 
notion of monotone sets as introduced [23] ; the discussion there, while not quantitative, does 
recover the results on verticality and uniqueness of blow ups. 

Monotone sets and half-spaces. In the definition that follows and elsewhere in this paper, 
E' denotes the complement of E. 

Definition 2.21. Fix an open set f/CH. Let lines([7) denote the space of unparametrized 
oriented horizontal lines whose intersection with U is nonempty. Let Mu denote the unique 
left invariant measure on lines (H), normalized so that Mu (lines (U)) = 1. A subset ECU 
is monotone with respect to U if for A/jj-almost every line L, both E D L and (U \ E) D L 
are essentially connected, in the sense that there exist connected subsets F L = F L (E),F' L = 
F'l{E) C L (i.e., each of Fl,F' l is either empty, equals L, or is an interval, or a ray in L) 
such that the symmetric differences (E fl L)AFl and ((U\E)n L)AF' L have 1-dimensional 
Hausdorff measure 0. 

Monotone subsets of EI were introduced in [23] where they were used to give a relatively 
short proof of the non-embedding theorem of [IB] , which does not require the introduction 
of FP sets and hence, does not depend on [29], [3D]. Instead, a blow up argument is used to 
directly reduce the nonembedding theorem to the special case in which the cut measure £/ 
is supported on monotone cuts; compare the discussion in the next subsection. For the case 
U = H, a nontrivial classification result asserts that if E is monotone, then C^(EAV) = 
for some half-space V; see [23] . 

Remark 2.22. If we regard EI as R 3 , then horizontal lines in EI are a particular codimension 
1 subset of the set of all affine lines in R 3 . A typical pair of points lies on no horizontal line. 
However, the classification of monotone subsets of H is precisely the same as for R 3 with its 
standard metric. In the latter case, the proof is trivial while in the former case it is not. 

Degeneracy of cut metrics which are supported on half-spaces. Once monotone 
subsets are known to be half-spaces it follows (after the fact) that the connectedness condition 
in Definition 12. 211 holds for almost every affine line L, i.e., not just for horizontal ones. Thus, 
if a cut measure £ is supported on monotone cuts, has the property that if x%, x 2 , x 3 G L 
and xi lies between x\ and X3, then ds(xi,X3) = ds(xi,X2) + ^2(^2, £3)- But if L is not 
horizontal, i.e., a coset of the center, then d m \i is comparable to the square root of the 
coordinate distance, and it is trivial to verify that this metric is not bi-Lipschitz to one with 
the property mentioned above; see (14.11) . 

In proving Theorem ll.il we will show that on a definite scale, using Theorem 14.131 we can 
reduce modulo a controlled error, to the case in which the cut measure is supported on cuts 
which are close to half-spaces. The error term, though controlled, is larger than the term 
which corresponds to the model case of monotone cuts. 
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5-monotone sets. Theorem I4.13[ which asserts that an approximately monotone set is 
close to some half-space (in the sense that the symmetric difference has small measure) 
plays a key role in the proof of Theorem 11.11 But unlike in [23], we cannot dispense with 
consideration of FP sets. We use (12.101) . the bound on the total perimeter, to obtain a bound 
on the total nonmonotonicity, and hence, to get the estimate for a scale on which the total 
nonmonontonicity is so small that Theorem 14.131 applies; see the discussion after Theorem 
ll.ll and compare Remark 12.231 below. 

Remark 2.23. If one restricts attention to 1-Lipschitz maps / : B — > L\, rather than more 
general BV maps, then using the fact that the 1-Lipschitz condition is preserved under 
restriction to horizontal lines, it is possible to derive the above mentioned bound on the 
total nonmonotonicity without reference to FP sets. However, even for Lipschitz maps, the 
introduction of FP sets cannot be avoided; see Lemma 14.21 which concerns an issue arising 
from the fact that the mass of the cut measure can be infinite; see also Proposition 15.21 

We will need to consider subsets, E C B R (p) C H, which are ^-monotone on B R (p); see 
Definition 14.111 Here, small 5 means approximately monotone, and 5 = corresponds to the 
case of monotone sets as in Definition 12.211 After rescaling, Theorem 14.131 states that there 
exists a constant a < oo, such that if E C B R (p) C HI is e a -monotone, and R > e -3 , then 
£ 3 ((E n Bi(p))A(V n5i(p)))< e, for some half-space V . 

Remark 2.24. Absent the assumption R > e~ 3 , the conclusion of Theorem 14. 131 can fail, even 
if "e a -monotone" is replaced by "monotone" ; see Example 19 .21 For the analogous result in 
MJ 1 , e~ 3 can indeed be replaced by 1. 

Remark 2.25. The discussion of monotone sets can be formulated for arbitrary PI spaces; see 
|22j . But in general, monotone subsets need not be rigid. In [21], the flexibility of monotone 
subsets of Laakso spaces [15] is used to construct bi-Lipschitz embeddings of Laakso spaces 
into L\. This is of interest since these spaces do not admit a bi-Lipschitz embedding into 
any Banach space with the Radon-Nikodym Property, e.g., separable dual spaces, such as 
L p , 1 < p < oo or £i; see [20] . 

The kinematic formula; perimeter and nonmonotonicity. To bound the total non- 
monotoncity in terms of the total perimeter, we use a kinematic formula for the perimeter of 
an FP set. From now on, we will just refer to the kinematic formula. In K n , the kinematic 
formula expresses the perimeter of an FP set E, as an integral with respect to the natural 
measure on the space of lines L, of the 1-dimensional perimeter function PEK(Er\L). In [55J, 
a suitable kinematic formula is been proved for Carnot groups; see fl6.ll) . In that context, a 
line means a horizontal line L 6 lines (H). 

Up to a set of measure zero, an FP subset of R is a finite union of disjoint closed intervals 
and the perimeter, PEK(E nl), is the number of end points of these intervals; see [3]. On 
the other hand, the condition that E is monotone can be reformulated as the requirement 
that for almost every line L, the 1-dimensional perimeter PER(£' n L) is either or 1. Since 
our initial quantitative data provides an integral bound on the mass of the total perimeter 
measure (see (12.101) . (13.141) ) it is not surprising that the kinematic formula plays a key role 
in our discussion. 
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3. Preliminaries 



In this section, X denotes a PI space. In particular, it could by W 1 or HI. Fix p £ X 
and let / : B r (p) — > L\ denote a Lipschitz map. In this paper we will often rescale this ball 
to unit size and correspondingly rescale the map /, and hence, the induced metric, c?s„, or 
equivalently, the cut measure. Finally, we rescale the measure, n, so that the rescaled ball 
has unit measure. 



We set 




(3.1) 


/ = r- 1 /, 


(3.2) 


E/ = r- 1 -E /j 


(3.3) 


d x = r~ 1 -d x , 


(3.4) 


M > p{B r {p)) 



In generalizing our considerations to BV functions, or in particular, sets of finite perimeter, 
it is of interest to consider as well, the effect of rescaling as in C I3 . 31) . (13 AL with ( 13.11) . ( 13.21) 
omitted. 

Note that Per (£)([/) is defined by: 
(3.5) Pei(E)(U) = inf liminf / Lip hi dp, 

{hi} » Ju 



L 



loc 



where the infimum is taken over all sequences of Lipschitz functions {hi}^, with hi — ► xe- 
Since rescaling as in (13.31) . has the effect, 

(3.6) Lip(/t;) -> r • Lip(/ii) , 
it follows that (JH3D, (E3D, imply 

(3.7) Pe rW _ r .- I _L_.p er (£). 

Remark 3.8. If ( 13. II) . ( 13. 2ft . are omitted, then ( 13.71) is a relevant rescaling. For the case of FP 
sets in particular, it is the relevant rescaling. It can be used to give a quantitative analog of 
Theorem II. II for a single FP set, E, in which the set of points at which on a controlled scale, 
E is not close to a half-space, has small codimension 1 Hausdorff content, as measured with 
respect to coverings by balls of small radius. 

If ( 13. ip . (13.21) . are not omitted then by (13. 2p . (13.71) . the corresponding rescaling factor for 
the total perimeter measure, A/, is r • ^ ^ ■ r _1 , i.e., 
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Normalization. After rescaling d x ,fi,f, as in (13.31) . (13. 2p . (13.41) . we will often denote the 
rescaling of the ball, B r (x) as B r (x). Thus, 

(3.10) B r (x)cX, 

(3.11) #(£r(aO) =1, 

(3.12) / : B r (x) — > L\ , 

(3.13) Lip(/)=1, 
or more generally, 

(3.14) f Pei{E) (B r {x)) dE f {E) < 1 . 

JCut(B r (x)) ^ ' 

Standard inequalities. We will make repeated use of the trivial inequality (often called 
Markov's inequality), which states that for a measure space (Y, v) and / G Ia(Y, ^), one has 

(3.15) v{{x\f{x)>t})<r l ^\f\dv. 

We also use the weak-type (1,1) inequality for the maximal function, which is a conse- 
quence of the doubling property of the measure; see e.g. Chapter 1 of [61J. We now recall 
this basic estimate. 

Let j3 denote the doubling constant of \l. Let ( denote a Radon measure. Fix an open 
subset U and let C denote the collection of closed balls B r (y) such that B 5r (y) C U. Given 
k,j G N, define B jik = B jjk {() C C, by 

(3.16) B j>h := {Wv) e C I C (bAv)) > kr-i ■ p (fl^)) } . 
and set 

(3.17) B j>k = |J 1^). 

B r (q)eBj, k 

By the standard covering argument (see [61], [37]), there is a sub-collection {B ri (qi)}iZi C fi^fc, 
such that 

oo 

(3.18) fl i)Jfc cU^(fc), 

i=l 

oo 

(3.19) X^'-M^te))^*" 1 '/^'^- 

i=i 
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4. Reduction to the stability of individual monotone sets 



In this section we reduce the proof of the main degeneration theorem, Theorem ll.lt to 
Theorem 14.131 a stability theorem for individual monotone sets, which states roughly, that 
a set which is almost monotone is almost a half space. The complete proof of Theorem 14.131 
will occupy Sections I7HTT1 

The next few paragraphs contain an overview of this section. For definiteness, we will 
restrict attention to the Heisenberg group. Everything we say applies, mutadis mutandis, to 
the simpler case of M n as well. 

We begin by considering an ideal case, the proof of which is given prior to the more 
general Proposition 14.141 Then we explain how to reduce to ideal case up to an error which 
is controlled. 

Let dp denote a cut metric for which the cut measure is supported on cuts V which are 
half-spaces. Let e > 0. We will see by an easy argument that for every affine line L which 
makes an angle > 9 > with the horizontal plane, for a definite fraction of the pairs of 
points Xi,x 2 G L, with dP(xi, x%) = e, we have 

(4.1) d P (x 1 ,x 2 ) <e e ■ d a (xi,x 2 ) , 

where the implies constant in ( 14.11) depends only on 9. 

In proving Theorem II .![ we first find a scale such that at a typical location, the support of 
the cut measure consists almost entirely of cuts which are almost half-spaces, i.e., cuts which 
differ from half-spaces by sets of small measure. As explained below, for this, it suffices 
to find a scale on which the total nonmonotonicity is small. By the pigeon hole principle, 
it is easy to find a scale on which the total perimeter is small. On such a scale we apply 
Proposition I4.25[ which asserts that total nonmonotonicity of a cut metric can be bounded 
in terms of the the total perimeter. Proposition 14.251 is proved in Section [6j 

On a scale on which the total nonmonotonicity is small, the effect of cuts which are not 
almost half-spaces can be absorbed into the error term, after which they can be ignored. 
However, to get to a situation in which our conclusion can be obtained by appying Theorem 
I4.13[ we must further reduce to one in which there is a suitable bound on the mass of the cut 
measure. Otherwise, the total effect of the small deviations of the remaining individual cuts 
from being half-spaces could carry us uncontrollably far from the ideal case considered in 
(14. ip . This point is addressed in Lemma [4.21 which states that the cuts can be decomposed 
into a subset which makes a contribution which can be absorbed into the error term for the 
cut metric, and one for which the mass has a definite bound. Lemma I4~2l is proved in Section 

El 

Controlling the cut measure. The following lemma will be applied to rescaled balls, on 
a scale on which the total nonmonoticity is sufficiently small. For / Lipschitz, relation (14.31) 
in the hypothesis of Lemma 14.21 holds for all such balls. If more generally / is BV, then for 
most such balls, it holds after suitably controlled rescaling. 

Given metrics, d, d', define \\d — d'W^i as in (12.91) . 
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Lemma 4.2. Let f : B\(p) — > Li satisfy 

(4.3) A/^fr))^!- 

Given rj > 0, the support of can 6e written as a disjoint union D x U D 2; swc/i i/iai i/ 
df = di + d 2 denotes the corresponding decomposition of the cut metric df, then 

(4.4) H f {D x ) < rj- 3 , 

(4-5) l|d/-di|k<»7- 
/// is 1-Lipschitz, then so are d x ,d2- 

Lemma I4T21 is a consequence of the more general Proposition 15.21 which is proved in Section 

El 

Stability of monotone sets. Let lines (H) denote the space of unparameterized oriented 
horizontal lines, and let lines (t/ - ) denote the collection of horizontal lines whose intersection 
with U is nonempty. Let M denote the unique left-invariant measure on lines (M), normalized 
so that 

(4.6) M{\mes{B x {e)) = 1 . 

Let 7i l L denote 1-dimensional Hausdorff measure on L 6 lines (H) with respect to the metric 
induced from d m . We note that if E C H is measurable, then L n E is measurable for A/"-a.e. 
L e lines(H). 

Given a ball B r (x) C H and L e lines (B r (x)), we define the nonconvexity of (E,L) on 
B r (x), denoted NC Br ( x )(E, L), by 



I C L fl B r (x) subinterval > , 



(4.7) KC Br[x) {E, L) := inf if \ X i - XEnLnB r ( x )\ dU\ 

L J LC\B r {x) 

where we allow in the infimum above the subinterval / to be empty. Similarly, we define the 
nonmonotonicity of (E,L) on B r (x) by 

(4.8) NM BrW (fi, L) := NC Br{x) (E, L) + NC Br{x) (E', L) . 

The total nonconvexity and total nonmonotonicity of E on B r (X) are defined to be the 
following (scale invariant) quantities: 

(4.9) NC Br(x) (E) :=-J NC Br(x) (E,L)dAf(L) , 

r J\ines(Br(x)) 

(4.10) NM Mx) {E) := 1 / NM Br(x) (£, L) rfA^(L) , 

r Jlines(B r (x)) 

Note that NM^^)^) = if i£ is monotone, or more generally if the the symmetric difference 
of E and some monotone subset has measure zero. 

Definition 4.11. A cut E C B r (p) C H is said to be 5-monotone on B r (x) if 
(4.12) NM Br(x) (E) < 5 . 

We have the following stability theorem. 
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Theorem 4.13. There exists a > (e.g., a = 2 52 works here), such that if the cut E C B\{x) 
is e a -monotone on B\(x), then there exists a half-space PCI such that 

C 3 ((EnB e s(x))AV) < 
C 3 (B e s(x)) 

Cut measures supported on cuts which are almost half-spaces. In this subsection, we 
begin by verifying (14.11) . which concerns the case of a cut measure which is supported on half- 
spaces. Then we consider in Proposit ion 14 . 14[ the more general case of a cut measure which 
is supported on cuts which are almost half-spaces, i.e., cuts that satisfy the the conclusion of 
Theorem 14.131 Of course, there is an error term in the general case, which leads to a weaker 
estimate than in the ideal case. 



Proof of ( [^.Ip . Consider first an elementary cut metric on the real line, associated to a 
subset E such that E and E' are connected. Clearly, if x 3 lies between X\ and x 2 , then 

d E (xi, x 3 ) + d E (x 3 , x 2 ) = d E (xx,x 2 ) ■ 

By linearity, this holds more generally for cut metrics on the line whose cut measures are 
supported on such cuts. 

Let E-p denote a cut measure on B\(p) C H such that every cut in the support of S-p is 
of the form V D B\(p), for some half-space V. Then for almost every affine (not necessarily 
horizontal) line L, the restriction g?p|l is a cut metric as in the previous paragraph. 

Consider a subinterval I C Lfl Bi(p) with end points y, z such that say d m (y, z) = \. 
Let e be as in (14.11) and let ip > be arbitrary. Let a index those intervals J a <Z I whose 
end points xi iQ ,X2, Q satisfy d w (xi ja , X2, a ) = e, and such that d-p(xi ta ,X2, a ) > ip ' d w (xi,X2)- 
Note that if L is as in (14. ip (i.e., it makes a definite angle 9 with the horizontal plane) then 
C(J a ) x e • d m (x ltCe , x 2>a ) = e 2 . 

By the standard covering argument, there is a disjoint sub-collection { J ai , . . . , J aN } of 
{Ja}-, such that [j a J a C (5J Ql ) U • • • U (5J Qjv ), and by the length space property of dp, we 
have 

N ■ ipe < d v (xi,i,x 2 ,i) H h d p (x hN , x 2 ,n) < d v (y, z) = - . 

Thus, N < §(V> ■ e)" 1 and 

£(l) J «) ^ ^- 1 '6- 1 .e 2 = ^- 1 .e. 
By taking ip = e we get (14. ip . □ 



To simplify the statement of Proposition 14. 14[ it is stated for unit balls. In the application 
(to Lipschitz maps) we will set e\ = e 40 and consider the rescaling as in (I3.1l) - (l3.4p of B e 3 (x) 
to unit size. 

Proposition 4.14. Fix ip > 0. Assume that h : B\(p) —>■ L\ is such that S/j satisfies the 
total perimeter bound ^W, and assume that S/j is supported on cuts E C Bi(p), such that 
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for some half-space Ve, 

(4.15) C 3 ((V E nB 1 (p))AE) <(^-e 4 ) 4 . 

Then for at least a fraction 1 — if) of the affine lines L which make an angle at most | with 
the vertical^ such that Lfl B\(x) ^ 0, and at least two thirds of the (x x ,x 2 ) G (in B\(x)) x 
(LnBi(x)), satisfying 

(4.16) \e<d m (x u x 2 ) < ~e, 
there holds 

(4.17) d h (x 1 ,x 2 ) <e-d M (x 1 ,x 2 ). 
If h is 1-Lipschitz and 

(4.18) C 3 ((P E nB 1 (p))AE)<e 48 , 
then for all such L and at least half the points satisfying |JT6p ; 

(4.19) d h (x u x 2 ) <e-d M (x 1 ,x 2 ). 

Proof. Assume first that (I4.15P holds. Let D x , D 2 and d% = d\ + d 2 be as in Lemma \A. 21 with 
7] = ip ■ e 4 . Hence, T, h {D x ) < {ip • e 4 ) -3 and \\d h - dx\\ Ll < ip ■ e 4 . 

Using (14.15jl . it is straightforward to construct a half space Ve, which varies measurably 
with respect to (alternatively one can use a measurable selection theorem as in jB]), such 
that C 3 ((V E H B 1 (p))AE) < (V • e 4 ) 4 . Put 

(4.20) d V) i(xi,x 2 ) = I dv B (xi,x 2 ) dY>i(E) . 

From E h (D x ) < (ip ■ e 4 )" 3 , C 3 ((V E n B x {p))&E) < (ip ■ e 4 ) 4 , we obtain 

(4.21) \\di-d VA \\ Ll <iP-e 4 . 
This, together with \\dh — ^i||li ^ i> 4 e 4 , implies 

(4.22) \\d h -d Pil \\ L i < V'-e 4 . 

Write the integral in the definition of the Lj-norm in (14.221) as iterated integral, integrating 
first over pairs of points lying on a given affine line and then over the space of affine lines. It 
follows from Markov's inequality (13.151) that for a fraction > 1 — if> of affine lines L as above, 

(4.23) \\d h -d V!l \\ Lli L X L,CxC) <e 4 - 

On such a line L, consider the set of pairs satisfying (14.161) . for which (14.11) holds for the 
metric dp i. By noting that the measure of the space of pairs of points on L satisfying (14.161) 
is > e 2 , and applying Markov's inequality once more, on L x L to (I4.23p . we find that for at 
least half of the pairs points at which (14. ip holds for dp t i, the error term arising from (14.231) 
is < e 2 . This gives ( 127X71) . 

To obtain (I4.19p . note that since the space of affine lines has dimension 4, by taking 
ip = e 8 = (e 2 ) 4 , it follows that the set of lines intersecting Bi(p), for which ( I4.19P holds is 



5 ^ can be replaced by any number < ^. 
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< e 2 -dense with respect d m in the space of all such lines. In case h is 1-Lipschitz, this gives 
fT4TT9|) for all L. □ 

Scale estimate; total perimeter and total nonmonotonicity. Next, we show how to 
estimate from below a scale on which, apart from a collection of cuts which contributes 
negligibly to the cut metric df, the hypothesis of Theorem 14.141 will be satisfied at most 
locations; see Proposition 14.281 It is from this estimate that the logarithmic behavior in 
(1 1.2ft of Theorem 11.11 arises. 



Recall that the cut measure S/ is supported on cuts E with finite perimeter. Fix 5 > 0. 
Using the structure of finite perimeter subsets of EI and the kinematic formula, in Section [6] 
we will decompose the total perimeter measure A/ as a sum, 



(4.24) X f = J2 

3=0 



in such a way that the measure Wj controls the total non-monotonicity on the scale 5 J in the 
following sense. 

Fix j > and as usual, let Bi sj (x) denote the standard rescaling of the ball Bi sj (x) C 
BAp) as in fl3~T0l) - fl3~T4l) . 

Proposition 4.25. If for all r > 0, 

(4.26) Xf(B r (x)) < Cs(B r (x)) , 

then 

wj [Bi#(x) 



(4-27) / \ NM^ ,)(£") dTjf(E) < — ^ {- + 5. 



Cut 



In order to apply Proposition I4.25[ we need to find j such that Wj(Bi(p)) < 5, and hence, 



wABi SJ (x) 

at most locations, x, the term — ) f- is <o. 



Proposition 4.28. There exists j < 5 1 for which 

(4.29) M&ss(wj)<5. 
For such j and at least half the points x in Bi(p), 

(4.30) I NM^ {x) {E)dX f {E)<5. 

JCutj Bi gj (x) I I 63 



Proof. We have 

00 

(4.31) 1 > Mass(A/) = ^ Mass(w i ) . 

j=0 
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Thus the number of terms in the sum above for which Mass (wj) > S is bounded by < 5~ x , 
and the claim follows. The conclusion follows from the weak-type (1,1) inequality for the 
maximal function applied to the measure Wj, i.e., ( 13.191) . together with Proposition 14.251 □ 



Proof of Theorem II. li Let a be as in Theorem 14.131 Fix e > and let 

(4.32) 5 = ce 24 ^ +2 

for an appropriately small enough constant c > 0. By Proposition 14.281 can choose j < S" 1 , 
such that (14.301) holds for at least half the points x in Bi(p). Below we restrict attention to 
such a point x and rescale Bi sj (x) to standard size, denoting this ball as usual by Bi S j(x). 



By ( I4.27p . (14.301) and Markov's inequality (I3.15p . we can write Cut yBi 5J (x)j as a disjoint 

union Cut (Bi S j(x)j = D 3 U D^, with corresponding metric decomposition df = d 3 + c? 4 , 
where 

(4.33) V f (D 3 ) < ce 24 ' 6 + 2 , 

(4.34) NM^ jjW (£) < e 24 « \/EeD 4 . 

By ( 14.331) . we have d 3 (xi,x 2 ) < e 24 ' 6+2 , for all Xi,x 2 . Therefore, if df is restricted to B e 24-e(x) 
and this ball is rescaled to unit size, we get 

(4.35) \d f -d 3 \ <e 2 , 

while by ( I4.34p and Theorem 14.131 d 3 is supported on cuts satisfy which satisfy the hy- 
posthesis of Proposition 14.141 From Proposition 14.141 together with (I4.35p . Theorem 11.11 
follows. □ 



Preview of the proof of Theorem 14.131 After two preliminary technical results on 5- 
monotone subsets of EI have been stated in Section [71 and the classification of monontone 
subsets of EI has been reviewed in Section [SJ Theorem 14.131 is proved in Section However, 
there is a technical step in the argument (the nondegeneracy of the initial configuration) 
the proof of which, for reasons of exposition, is deferred until Section [TIB The proofs of the 
technical results, Lemma [T781 and Proposition 17. 101 are deferred to Sections [UJ [121 The proof 
of Proposition 17.101 is (by far) the most involved part of this paper. 



5. Cuts with small perimeter 



In this section we prove Lemma 14.21 which shows that cuts with very small perimeter 
make a negiligible contribution to the cut metric df. It is most natural to argue here in the 
context of general PI spaces. Thus, let X denote a PI space, for example, MP or EI. Fix 
p G X and let / : Bi{p) — > L\ satisfy, Lip(/) < 1. 

Put 

D 1 = {E\Ver{E){B x {x))<6}, 
D 2 = {E\ Per(£7)(Bi(z)) > 9}, 
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and let df = d\ + d 2 denote the corresponding decomposition of the metric df. Let (3 denote 
the doubling constant of X, and r', x > 1, the constants in the Poincare-Sobolev inequality 
fl23D. 

Proposition 5.2. If f : B\{p) — > L\ satisfies 

(5.3) \ f {B x {x))<K, 
then for all 9 > 0, 

(5.4) V f (D 2 ) < KB' 1 , 

(5-5) ¥f ~ d 2 \\ Ll < ■ . 

Proof. From (15. 3p and Markov's inequality (I3.15p . we get (14. 4p . 
For neN put 

D 1>n = {E\9 ■ 2~ {n+1) < Per(S)(Si(x)) < 6 ■ 2" n } . 
Then D% = UneN-^ 1 ."- Q5.3j) and Markov's inequality once more, we have 

E/(Z?i )n ) < K2 n+1 9~ 1 . 
Moreover, if E e D hn then by (1231) . 

(5.6) J \XE(x 1 )- X E(x 2 )\di2xdn<r' -(9-2- n ) x . 

JB 1 (x)xB 1 (x) 

By summing over n we get (15. 5p . □ 

Note that when Lip(/) < 1, by virtue of (12TT0|) . holds with if = c{(3,t'). To get 

(I4.16p . (14.171) from (15. 4p . (15. 5p . we take 9 to be a suitable multiple of rf (noting that for 
X = M, we have /3 = 16, x = §)• 



6. The kinematic formula and 5-monotone sets 



In this section, using the kinematic formula, we decompose the total perimeter measure 
as a sum of measures A/ = Wx + w 2 ■ ■ ■ ■ Then we prove Proposition 14.251 which states that 
Wj controls the total nonmonotonicity on the scale 5K 

We rely on the simple structure of sets of finite perimeter in dimension 1 and on the 
kinematic formula, which expresses the perimeter of an FP subset E of M" or a Carnot 
group, as an integral over the space of lines L, of the perimeters of the 1-dimensional FP 
sets E n L. 
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FP sets in dimension 1. Let V denote an open subset of K and let F C V have finite 
perimeter, i.e., Per(F)(V) < oo. Then there exists a unique collection of finitely many 
disjoint intervals, h(F), . . . , In(F), which are relatively closed in V, such that the symmetric 
difference of F and F(F) = {jf =1 h(F) has measure zero. Moreover, the perimeter measure 
per(F), is a sum of delta functions concentrated at the endpoints of these intervals, and the 
perimeter Per(F)(V) is equal to the number of endpoints; see Proposition 3.52 of |3j. In 
what follows, we will often assume without explicit mention that the set FcK, has been 
replaced by its precise representative F(F) = {jf =1 F(F). Note that X(F' fl V) = F(F)', 
per(F') = per(F). 

The kinematic formula. A kinematic formula exists for the PI space (H, d x ,C), and more 
generally, for any Carnot group, an in particular also for M n ; see Proposition 3.13, of [55J. 
Below, the notation is as introduced prior to (14.61) . 

Let U C H denote an open subset such that Per(E)(U) < oo. The kinematic formula 
states that the function L i— > Per(E fl L)(U fl L) lies in L 1 (lines(f/), A/"), and in addition (for 
some constant c = c(H) > 0) 

(6.1) Per{E){U) =c- [ Pei(E HL)(U fl L) dJ\f(L) . 

Perimeter bounds nonmonotonicity. Fix < 5 < 1. For all E, L as above, and j > 0, 
let Cj(E,L) denote the collection of intervals, I(E,L), occuring in X{E fl L fl E>i(x)) such 
that 

(6.2) 5 J+1 < length(/(£, L)) < 6 j . 

Let Sj(E,L) denote the collection of all end points of intervals in Cj(E,L). Let card(S*) 
denote the cardinality of the set S. For c(H) as in (16.11) and A C -Bi(x), put 

(6.3) Wj(E, L){A) = c(H) ■ card({e G £i(E, L) \ e e A}) , 

(6.4) W j{E){A)= [ Wj {E,L){A)dM{L), 

Jlincs(B 1 {x)) 

(6.5) w 3 {E){A) = G s (Ef){A) = \ ( Wj (E)(A) + Wj (E')(A)) . 
Finally, set 

(6.6) wj(A)= I Wj{E){A) d£ f {E) . 

JCnt(B 1 (x)) 

By the the kinematic formula (16.11) . we get the decomposition of of the perimeter measure 
and total perimeter measure, 

(6.7) Per(E) =Y t w j (E) , 

j 

25 



(6.8) A/ = J>i- 

3=0 

which is a restatement of (I4.24p . 



Proof of Proposition 4-25. Fix E with finite perimeter. For Af-a.e. L G lines (^Bi SJ (x)^. 



we 



can assume that L l~l E l~l -Bi^ (x) and L n -E' fl Bi^ (x) consist of finitely many intervals, 
in the natural consecutive ordering. Note that each the intervals h, ■ ■ ■ , In-i, 
(including both end points), are contained in Bi sj (x). In particular, these intervals lie in 
Ck(E, L), for various k > j. 

Since the nonmonotonicity NM^ m(E,L) is defined as an infimum over intervals (see 

(14. 8p ). it can be bounded from above by employing any specific interval I. For definiteness, 
assume say I x C E. where I N = if N = 1. If I/v C E, put I = L n Bi sj (x). If Jjy C E', 
put J = J( n Bigi(x). 

Rescale the ball Bi sj (x) to unit size as in (I3.10p - (l3.14p . Then by employing the above 
chosen interval /, we get 

(6.9) NM B W (J3,L)<£ Yl len S th (/)- 

k>j I&C k (E,L) 

Since an interval is determined by its endpoints, (16.91) implies 

(6.10) NM B [x) (E,L) < ^^Mass(^(E,L)) , 

and by integrating over the space of lines, we get 

(6.11) NM Bl . {x) {E) < Mass(^) + S Mass(Per(£)) , 

which completes the proof of Proposition 14.251 □ 

7. The quantitative interior and boundary 

The proof (though not the statement) of Theorem 14.131 utilizes quantitative notions of 
the interior and boundary of a measurable set E. In the present section, after defining 
these notions, we state two key properties for 5-convex and 5-monotone subsets of H; see 
Proposition I7.10[ Lemma 17.81 We also deduce two particular consequences of Proposition 
I7.10l for the structure of the quantitative boundary of 5- monotone sets. These are used in the 
proof of Theorem 14.131 Assuming Proposition 17.101 and Lemma I7.8[ the proof of Theorem 
14.131 is given in Section [9j The proofs of Proposition 17.101 and Lemma 17.81 which are the 
most technical parts of our discussion, are postponed until Sections [12] and [TTJ respectively. 

Definition 7.1. Let (X, d x , fx) be a metric measure space and E C X a measurable subset. 
For a G (0, 1) and u > define: 

fi(E n B u (x)) 



(7.2) mt a , u {E) := <^ x 



fi{B u {x)) 
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> 1 - a 



(7.3) d v (E) := i x 



fj,(B u (x)) 

Observe that d a>u (E) = for a > |. We note the following properties which are trivial 
consequences of the definitions. 



If P < a, then, 
(7.4) 

For U2 < u\ and x G X set 



int ftu (£) C mt a)U (E) 
d a , u {E) c d/3 >u (E). 



CiUx, U2, X 



Then 
(7.5) 



n(B U2 (x)) ' 

x G intQ, jU:L (i?) — x G int c ( ui)lt2 ,x)-a,u2 \E) 
x G d c ( UlU2X y aU2 (E) )' x G d a ui (-£/) • 



If x G d a>u (E) then 

(7.6) x G inti_ a , u (£) n int 
Finally, 

(7.7) (9 QiU (E))' = mt a , u (E) U int^^E'). 

Quantitative interior of 5-monotone subsets. If a subset E of a ball, Bi(p), is close 
to a half-space, then it follows that there exists a sub-ball of a definite size which is almost 
entirely contained in either E or E' . Conversely, proving that 5- monotone subsets have this 
property constitutes an important step in the proof that such a set is close to a half-space. 
(It is the remainder of argument, based on Proposition 17. 101 below, which requires that we 
shrink the size of the ball on which the conclusion is obtained. This is the content of Lemma 
17. 8[ which constitutes both Step A) of the proof in the precisely monotone case, treated in 
Section [8] and more generally, Step A') of the case, 5 > 0, treated in Section [91 

Lemma 7.8. There exists < c < ~ such that if E C B r (p) is e 2 -monotone on B r (p) then 
there exists q G Bi r (p) such that q G" d e>cr {E), i.e., q G int eiCr (£ l ) U int ejCr (£"). 

Quantitative convexity of 5-convex sets. If E C W 1 is convex and L is a line passing 
through points p G E and q G int(.E), then the segment of L lying between p and q also 
consists of interior points of E. Below, for subsets of EI which are almost convex, we given 
a quantitative version of this statement. 

We say that E C B r (p) is 5-convex on B r (p) if for NC Br (~ ) \(E, L) as in (|4.7p . 
(7.9) NC Br(p) (E)= f NC(E,L)a!Af(L)<5. 

J lines(_B r (x)) 
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Proposition 7.10. There exists universal constants c G (0,1) and C G (l,oo) with the 
following properties. Fix k, r], (,r,/)6 (0, 1) such that 

(7.11) p < min |-/tr 2 , c 
Set 

(7.12) Sl = CKW 



r 

(7.13) 5 2 = CK b 7i 3 £ 2 p 3 r 6 . 

Let L G lines(H) be parameterized by arc length and assume that 

(7.14) L(0) G inti_„, p (£) , 

(7.15) L(l) G int 5l)Cr (£) , 

and .E C EI is 52-convex on B 2 c(L(0)) , then for all s G [k, 1], 

(7.16) L{s) G mt^ crs (E) . 

Remark 7.17. The import of the proposition is the following: Even if L(0) is only in the 
very weak quantitative interior of E (r] small) we can ensure that L(s) is in the very strong 
quantitative interior of E (i.e. £ small), provided that £(1) is in the sufficiently strong 
quantitative interior of E (5i sufficiently small) and E is sufficiently convex (8 2 sufficiently 
small). 

Remark 7.18. The application to the proof of Theorem 14.131 requires some properties of 
the quantitative boundary for 5-monotone sets which are derived in the next subsection 
using Proposition 17.101 The assumption, L(0) G mt\^ VtP E, is guaranteed by assuming 
L(0) G d VtP E; see (ESJ). 

Remark 7.19. Note that the constants, 1 > n, rj,£,p > can be chosen arbitrarily small, 
provided r is then chosen to satisfy (17.111) . The particular form of (17.111) reflects the mul- 
tiplicative structure of H; compare Remark 112.191 In view of (17. 5p . by taking £ a definite 
amount smaller if necessary, we can replace csr in (I7.16P by any smaller positive radius. 



Lines in quantitative boundaries of 5-monotone sets. Given that any proper mono- 
tone subset, E, is a half-space, and hence, that BE is a 2-plane, it follows that if for 
L G lines (M), 2/1,2/2 G L PI dE, 2/1 7^ 2/2, then L C dE. Moreover, dE is a union of such 
lines L G lines (H). Conversely, these statements form two key substeps in the proof that 
monotone subsets H are half-spaces; see substeps 1) and 2) of part B) of Section [SI 

Essentially, Corollaries 17.271 17.411 of Proposition 17.101 constitute the corresponding sub- 
steps in the proof that ^-monotone subsets are close to half-spaces. Specifically, if we quantify 
the hypotheses of the above two statements, then the conclusions hold in a weaker quantita- 
tive sense. Prior to tackling these corollaries, whose statements are somewhat complicated, 
the reader may wish to look at the proofs of above mentioned substeps given in Section [HI 
Here and in Section [SJ the skeleton of the argument is precisely the same, but in Section [HI 
the technical complications are absent. 
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Below, we have sacrificed some sharpness to avoid further complication in the relevant 
expressions. 



Corollary 7.20. Let C, c be the constants from Proposition ^ '. 1 (J\ For every k, ai, a 2 £ (0, 1), 

ui G (o, m 2 6 (0, CK 2 ) define 



(7.21) 7 = C-max<' J^H,^ 



K CK 



and 

C b a\a 2 u\u 2 K 



(7.22) (3 



c 3 7 5 



7.23 5= — . 

Fix L G lines (H) which is parameterized by arc length and E C B 2 c{L(0)) which is 5- 
monotone on B 2 c(L(0)). Assume that 

x 1 = L(0) G d aitU1 (E) , x 2 = L(k) G d a2tU2 (E) , 

so that in particular d m (x, y) = k. Then for all t G [k, 1] we have 

L(t) e d^(E) . 

Proof. Assume for the sake of contradiction that L(t) ^ dp a {E). Thus without loss of 
generality we can assume that, say, x\ G iiiti_ ai)Ul (.E') and Lit) G 'mtp^(E). In order to 
apply Proposition 17.101 rescale the metric d m — > jd m , so that the rescaled distance between 
x\ and L{t) is equal to 1. Let L'(s) = L(ts), so that L' is parameterized according to arc 
length in the rescaled metric. We will apply Proposition 17.101 with the following parameters: 

(7.24) ti = qlx, P=~r, r =777> s = t> ? = "2 



Note that, with this notation, the parameter (3, as defined in ( 17.221) . is at most 5%, as given 
in (17.121) (where we used the assumption t > «). Hence in the rescaled metric, and using the 
notation in (17.241) . we have x\ = L'(0) G inti_ J?iP (S), L(t) = G mts lt cr(E). Moreover, 

after rescaling, E is 5-monotone on B 2C /t(xi), and hence E is ^-monotone on B 2C {xi). 
Observe that with 5 given in (I7.23p . is at most 5 2 as defined in (17.131) (using the assumption 
t > k). We are therefore in position to apply Proposition I7.10[ provided that we check that 
with our definitions n,r),£,r,p G (0,1), s G [k, 1], and (17. lip is satisfied. These facts are 
ensured by the assumptions u\ < u i < Cn 2 , t G [k, 1], and (17.211) . So, the conclusion of 
Proposition 17.101 says that in the rescaled metric we have x 2 = L'(s) G mt^ crs (E). 

Rescaling back to the original metric d , we see that x 2 G mt^ first {E) = int^^(E). In 
other words: 
(7.25) 

C 3 (E' n < t ■ ^{B^{x 2 )) = £ ■ (-^-) ■ C 3 (B U2 (x 2 )) ^a 2 -C 3 {B U2 {x 2 )) . 



2!) 



Note that ^ > u 2 , and hence Be2±{x 2 ) D B U2 (x 2 ), as ensured by (I7.2ip (since t < 1). 
Moreover, the assumption x 2 G d a2>U2 (E) implies that 

(7.26) £ 3 {E' n 5 w (x 2 )) > £ 3 (£' n 5 U2 (x 2 )) > a 2 • £ 3 (5 U2 (x 2 )) . 

Inequalities (17.251) . (I7.26P yield the desired contradiction. □ 

Corollary 7.27. Let C,c be the constants from Proposition 7.10 For every k G (0, |], 

Qti, a 2 G (0, 1), ui G (o, gjr J , u 2 G (o, fjj de/me 



7* 



C 3 / 2 ( - 

K 



1/2 J^iV 74 

max < I — I 




ft 



128C 29 afof?>2 2 
K 5 7 i9 

2 16 C 41 a?ai^ 20 



(7.28) 

and 
(7.29) 

( 7 ' 3 °) - 81c 16 /€ 8 7 f 

Fix L G lines (H) parameterized by arc length and E C i?3c(L(0)), which is 5^-monotone on 
i?3c(L(0)). Assume that 

x x = L(0) G d aim (E) , x 2 = L(k) G d a2;U2 (E) , 

so that in particular d (x, y) = k. Then for all t G [0,1] we /mve 

(7.31) L(t)ed^(E). 



Proof. The case £ G [«, 1] is a simple consequence of Corollary 17.201 Indeed, since E is 5*- 
monotone on B^c(L(0)), it is also (|) • ^-monotone on B 2 c(L(0)). Note that (|) • 5* < S, 
where 5 is given in (IT.23j) (this follows immediately from the definitions and our assumed 
upper bounds on ui,u 2 ). We can therefore apply Corollary 17.201 to deduce that L(t) G 
dp a (E), where /3, 7 are given in (17.221) . (17.211) . Note that 7* > 7, and hence by (I7.5P we have 



(7.32) 
Since /3* < 



(3 we can use (I7.4p to deduce that L{t) G d^ iJit (E), as required. 



To deal with the case t G [0, k], note that since, as explained above, E is 5-monotone on 
B 2C (L(0)), by Corollary E20] we have L(2k) G d Pa {E). We wish to apply Corollary EH] 
to the points L{2k) 1 x 2 = L(K),L(t), and to the geodesic L'(s) = L(—s + 2k) (so that 
L'(0) = L(2k), L'(«) = L(k), L'(2n-t) = L(t), where 2k -t G [k,1]), with (ai,ui) replaced 
by (/3,j) and (a 2 ,w 2 ) unchanged. Note that it follows from (17.210 . (17.280 that: 



(7.33) 



7* 



cJ 2 ^ 



C • max < \ — , — 



K 




K CK 



where in the last step above we used the fact that 7 > which implies that y ^ > ^ by 
our assumption on u 2 . We can restate ( 17.331) as saying that 7* corresponds to the quantity 7 of 
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Corollary 17.201 where we substitute our new choice of parameters. Moreover, by substituting 
these values into (17.231) and (17.221) . we see that for these new parameters (|) • 5* is equal 
to the value of 5 from ( 17. 23ft . and /3* is equal to the value of 3 from ( 17.291) (we chose the 
values of /3*, 5* precisely for this purpose). Note that since E is ^-monotone on 5 3 c(L(0)) D 
B2c(L(2k)), it is also (|) • ^-monotone on B 2 c(L{2k)). In order to apply Corollary 17.201 

we also need to ensure that 7 = C ■ max { \/—, — \ < -rr and 3 = ° a i^ 2 ^i"2 K < 1 which 

is indeed the CRSG, clS follows from the assumed upper bounds on ui,U2- The conclusion of 
Corollary 17.201 is precisely ( 17.311) . □ 



Corollary 7.34. Let C, c be the constants from Proposition \7.1C\ Fix a,u E (0, 1), t G (0, |] 

and d > 0, such that: 

(7.35) — u<d<minl2C,— \ -t, 



c ,, sc 



4 



(7.36) cm 4 < min j ^d d t, -^dt 

Denote: 



(7.37) 



C 20 aV 6 
2V2 d 8 t 8 ' 



C 17 a 5 u 20 

(? - 38) C= 2^'^' 

Assume that x G H and £/ C B2c{x) are such that x G d a , u (E). Assume also that E is 
(-monotone on B 2 c(x). Then there exists L G lines(H) with L(0) = x such that either L(t) 
or L(—t) is in d^d(E). 



Proof. Take any L G lines(H) with L(0) = x. Note that (17T351) . (17361) . (17T371) imply that 
4> <\- Hence, we are done if one of {L(— t), L(t)} is in mt^^E) and the other is in int^^-E 1 '). 
Indeed, in this case since the collection of all oriented lines through x is connected (it can 
be identified with the unit circle) the required result follows from the Intermediate Value 
Theorem. 

Assume for the sake of contradiction that the assertion of Corollary 17.341 is false. Then 
by the above discussion {L(— t),L(t)} C int^(.E) or {L(— t),L(t)} C mi^^E 1 ). So, assume 
without loss of generality that {L(—t), L(t)} C mt^^E). In order to apply Proposition 17.101 
we rescale the metric: d m — > hd u . Denote y\ = L(—t), 1/2 = L(t), so that in the rescaled 
metric the distance between y\ and ?/2 is 1, and their distance from x is ~. The geodesic 
L'(s) = L(2ts — t) is parameterized by arc length in the rescaled metric with L'(0) = yi, 
V (|) = x, L'{1) = y 2 . We will apply Proposition 17.101 to the geodesic L' with the following 
parameters: 

2Cu\ A d _C 5 au 4 

~c~d) ' r ~2C~r P ~8c^'~d¥' 
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(7.39) 



K = 7] 



2' ? 



a: 



Note that (I7.35p . (I7.36P ensure that p < min {i/?r 2 ,c} and r, £ < 1. In the rescaled metric 
yi,2/2 ^ m t^ id /( 2t )(_E) = int0 )C - r (_E). Since p < A we may conclude from (I7.5P that 



In the rescaled metric is ^-monotone on B c /t(x). Since t < | we have B2ci.U1) C B c /t{x) 
C 



and therefore .E is T^r-monotone on B2c(yi)- Observe that with the parameters set as 



in (I7.39p . we have 5\ = <p and 62 = -p§y[, where 61,62 are as in (I7.12p . (17.131) . Therfore, we 
can apply Proposition 17.101 which implies that (in the rescaled metric) 

(7.40) x G mt£ iCrs (E) hit (2 cu^ _cd_(E) . 

But, in the rescaled metric we know that x G d a ^(E). Note that it follows from (I7.35P that 

f t <j§- t - Hence, by (ESJ) and flUD we deduce from fl7T40|) that x G mt a> ^(E) C (d a ,jt (£))', 
which yields the desired contradiction. □ 

Corollary 7.41. There exist universal constants c, C > tira^/i the following properties. Fix 
a G (0, 1) and u G (0, c). Lei a; G H an<i .E C B^x) be 6-monotone on Bjj{x), where 

(7.42) 5 = c-a 29 u 71 . 

Assume also that x G d ayU (E). Then there exists L G lines(H) ; parameterized by arc length, 
such that x = L(0) and for all s G [— |, |] we /lane: 

(7-43) L( S )g^. q19m46 ^ 1/4 (E). 

Proof. Below we will have C > 4C, where C is the constant from Proposition 17.101 We will 
first apply Corollary 17.341 with t = | and d x u, while noting that if d is a large enough 
multiple of u, and c is small enough (i.e., u is small enough), then the conditions (17.351) . (17.361) 
are satisfied. For <p as in (I7.37P we have x a 4 u s , and for £ as in (17.381) we have ( x a 5 u 15 . 
Note that since E is 6- monotone on i%(x) it is also (^-monotone on B 2 c{x), provided c is 
small enough. Hence, it follows from Corollary 17.341 that there exists L G lines(HI) with 
L(0) = x and L (-|) G d^E). 

Next, we will apply Corollary 17.271 with X\ — L (— §), x 2 = x, n = |, ai = 0, ui = d, 
0:2 = a, U2 = u. Provided c is small enough, Ui,u 2 satisfy the conditions of Corollary 17.271 
For 7* as in (17.281) we have 7* x -u 1 / 4 , for (3* as in (I7.29p we have /?* x a 19 u 181,/4 , and for 
5* as in (I7.30P we have 5* x a 29 u 141 / 2 . Moreover, since E is 5-monotone on Bjj{x) it is 
also ^-monotone on Bsc{xi), provided c is small enough. Corollary 17.271 now implies that 
L(s) G d^^^E) for all s G [— |, |], as required. □ 

8. Classification of monotone sets 

It was shown in [23] that a proper nonempty monotone set E C EI is, up to a set of measure 
zero, a half-space V. In this section we recall the proof of this result, which is an essential 
preliminary for proof of Theorem 14.131 the stability version for 5-monotone sets, given in 
Section O To give the key ideas, it suffices to consider the case of precisely monotone sets. 
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We call a set E precisely monotone if E D L and E' C\ L are connected for all L (rather than 
connected modulo subsets of measure zero). 

There are two main steps: 

A) E n B r (p) or i?' n B r (p) has nonempty interior for every r > and pGi. 

B) Either <9.E is contained in a 2-plane, in which case (by a trivial connectedness argument) 
the theorem holds, or BE has nonempty interior. 

Step A) follows from the special case of Lemma 17.81 in which e = 0; the general case of 
Lemma I7T51 is A'), the quantitative version of A), which is used in the proof of Theorem 14. 131 
Below, we prove Step B). 

Proof of Step B). It will suffice assume that dE is nonempty. 
Proof. The proof of B) has the following substeps. 

1) If L e lines(H) and L D dE contains at least two points, then L C dE. 

2) dE is a union of lines. 

3) Steps 1) and 2) imply B). 

Steps 1), 2), are special cases of Corollaries 17. 27} I7.41| respectively. But for clarity, we 
will repeat the proofs in the present simpler (nonquantitative) situation. The proof of step 
3) requires some additional properties of pairs of lines, which are established in the next 
subsection. 

Properties of pairs of lines. Li,L 2 G lines (H) are called parallel if L 2 = g ■ Li, or 
equivalently, L\ = g~ l ■ L2, for some g £ H. This holds if and only if the projections 
7r(Li), 7r(L 2 ) are either parallel or coincide. Unless ir{L\) = 7r(L 2 ), the lines Li,L 2 are not 
parallel as lines in R 3 . 

If Li, L 2 are not parallel, then their projections intersect in a unique point. If in addition, 
L\ fl L 2 = 0, then the pair Li, L 2 is called skew. We put 

(8.1) F(L 1 ,L 2 )=7r- 1 (7r(L 1 )n7r(L 2 )). 
Define the vertical distance between Li, L 2 by 

(8.2) d v (Lx, L 2 ) = rf H (L! n F(Lx, L 2 ), L 2 n F(L 1; L 2 )) . 

The following lemmas describe the family of lines L which intersect both Li and L 2 , where 
Li,L 2 , are either parallel lines with distinct projections, or skew. At this point, the roles 
of Li, L 2 are symmetric and assertions about Li should be understood as applying to L 2 as 
well. 
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Lemma 8.3. If L\,L 2 G lines (H) are parallel, and vr(Li) 7^ ir(L 2 ), then: 



1) For x G L\, there is a unique x* G L 2 , such that x,x* lie on a line L(x) G lines (H). 
Moreover, there is a unique shortest line segment b(Li, L 2 ) from L\ to L 2 which is orthogonal 
to both Li and L 2 . 

2) There exists a unique point m = m{Li,L 2 ) G M 2 lying halfway between vr(Li) and 7r(L 2 ), 
such that for all x G L\, the line L{x) intersects the fibre ■w~ 1 {m). Moreover, every point on 
7r _1 (m) intersects some line L(x). 

3) The union of the lines {L^)}^^ is a smooth ruled surface X = X[L\,L 2 ) defined over 
IR 2 \ T, where T = T(Li,L 2 ) denotes the line parallel to n(Li),it(L 2 ), which passes through 
m(Li, L 2 ). At no point x G X is X tangent to the horizontal subspace H x . 

Proof. Let Og be as in the subsection of Section [I] entitled "the Heisenberg group as a Lie 
group". By applying Og for suitable 9, and then applying a suitable left translation we 
can assume that L\ = (0,0,0) • (i, 0, 0). Next, after a suitable translation (a", 0,0) (which 
leaves Li invariant) we can assume that assume that L 2 intersects the plane (it, v, 0) along 
the w-axis, and hence can be written as (0, b", 0) • (s, 0, 0) (we are using here the fact that 
7r(Li) 7^ tt(L 2 )). Finally, after a translation of the form (0, —b"/2, 0) we can assume that 

(8.4) L\ = (t, e, — et) L 2 = (s,—e,es) 
for some eGl. 

For this pair, if x = (t, e, — et), then 

(8.5) x* = {-t, -e, -et) . 

To see this, observe that the line L(x) in R 3 which joins these points, is parallel to the plane 
(u, v, 0) and passes through the point (0,0, —et). So L(x) G lines(H); see (I2.15p . The fiber 
7r _1 ((0, 0)) is the unique fiber through which all of these lines pass as t varies in (—00, 00), 
and every point on 7r _1 ((0, 0)) arises in this way. Thus, for this pair, 

(8.6) m(L 1 ,L 2 ) = (0,0). 

The unique shortest line segment joining L\, L 2 has length 2e and is contained in the line 

(8.7) b(L u L 2 ) := (0, s, 0) . 
The surface, X, is given by 

(8.8) (u,v, -e 2 u/v) . 

From these facts, all remaining statements are straightforward to verify. □ 

Lemma 8.9. If L 1 ,L 2 G lines(H) are skew, then there is a hyperbola Y = Y(Lx,L 2 ) C M. 2 
with asymptotes 7r(Li), vt(L 2 ), such that every tangent line of Y has a unique lift, L G 
lines (H), which intersects both L\ and L 2 . Conversely, if L G lines (H) and L R Li 7^ 0, for 
i = 1,2, then ir(L) is tangent to Y . Except for the unique point L\ D F{L\, L 2 ) (respectively 
L 2 fl F[L\, L 2 ) ) every point x of L\ lies on a unique line, L(x), intersecting L 2 (respectively 
every point x of L 2 lies on a unique line, L(x), intersecting L ly ). 
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Proof. By applying a suitable left translation and then an isometry Og and after possibly 
interchanging the subscripts of L\,L2, we can assume the canonical normalization: 

(8.10) L 1 = (t,wt,c), L 2 = (s,-ws,-c), w,c> 0, 7r(Li) n vr(L 2 ) = (0,0). 

From the geometric interpretation of the correction term given after (I2.13p . it follows that 
the points p = (t, wt, c) G L\, and q = (s, —ws, — c) G L 2 lie on some L G lines (H), precisely 
when there is parallelogram in M 2 spanned (s, —ws), (t,wt), with oriented area 2c. Define 
0(^,^(0,70 by 

(8.11) tfm{^0{L 1 ,L 2 ))=w, 
or equivalently, 

(8.12) taji(-6(Li,L 2 ))-ts = w-ts = c. 

Then (1 + w 2 ) ■ (sin#(Li, L 2 )) ■ ts = 2c. The envelope of the resulting family of lines is the 
hyperbola, Y, with equation: 

(8.13) w 2 u 2 -v 2 = cw. 

□ 

Corollary 8.14. Let Li,L 2 be skew lines as in the proof of Lemma \8.9[ Put x = (t,wt,c), 
x = (— t, —wt, c) (where x, x G Li). Then as elements of 'lines (H), the lines L(x), L(x), are 
parallel, and 

m(L(x),L(x)) = (0,0) . 

There are precisely two shortest horizontal segments joining L\ and L 2 . In the notation 
of the proof of Lemma [8791 these are the segments which project to the segments that touch 
n(Li), tt(L 2 ) and are tangent to the hyperbola Y at its focal point ±^J~cjw. Thus these 
segments are contained in the lines: 



(8.15) 



b+(L 1 ,L 2 ) := ( x /—,s,J—-s 
w V w 



b-(L u L 2 ) := ( - A /-,s, -J— ■ s 
w V w 



Remark 8.16. A pair of parallel lines, L\,L 2 , with distinct projections, can be viewed as 
a limiting case of a pair of skew lines L 2 t in which lim^oo 9(Li tt , L 2 t ) = (where 
9(Li it , L 2 t ) is the angle between 7r(L 1)t ) and 7r(L 2j t)), and as t — >• oo, one of b±(Li >t , L 2tt ) 
moves off to infinity while the other converges to b(Li,L 2 ), of the pair of parallel lines 
L\,L 2 ; compare (18.71) . 

Proof of 1). Let p G BE. Then the exists pi G E, with pi — > p. Let q G 'mt(E), the interior 
of E, and assume that there is a line, L G lines (M), with L(0) = p to L{[) = q. If Lj denotes 
the line parallel to L passing through pi then Li(£) — > L(£) and hence Li(£) G int(E) for 
i sufficiently large. Since E is convex, by (the non-quantitative version of) Corollary 17.271 
we get Li(s) G int(E'), for < s < £ and i sufficiently large and by passing to the limit 
L(s) G int(-E), for < s < £. Since, E' is also convex, the corresponding statement holds for 
E' as well. This completes the proof of sub-step 1). 
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Proof of 2). Let p G dE, L G lines (H) with p = L(0) and fix q = L(£) ^ p. We need 
to show that q G dE. By 1), we can assume say L(£) G mt(E). Similarly, L(—£) G int(-E') 
or L{—£) G int(-E"). By the same argument as in in the proof of 1), if L{£) G mt(E) and 
L(—£) G mt(E), then p = L(0) G int(-E). Therefore, we may assume that L(—£) G int(£"). 
Since the collection of unit tangent vectors to the collection of horizontal geodesies at p is 
a circle, and in particular, connected, it follows by continuity that there exists a horizontal 
geodesic, L, with L(0) = p and L{£) G dE. The conclusion of 2) now follows from sub-step 
!)• 

Proof of 3). By 2), dE is a union of lines. If dE contains only parallel lines with the 
same projection, then it is contained in a vertical plane. Likewise, if dE contains only lines 
passing through some fixed point, then it is contained in the horizontal 2-plane through that 
point. Thus, we can assume that dE contains either a pair of parallel lines with distinct 
projections or a pair of skew lines. By Lemma 18.31 if dE contains a pair of parallel lines 
Li, L 2 with distinct projections, then distinct members of the family of lines {L(x)} x& Li are 
skew lines which intersect 7r _1 (m(Li, L 2 )). So we can assume that dE contains a pair of 
skew lines L\, L 2 . 

Claim: If L\, L 2 are pair of skew lines, with L\ U L 2 C dE, then 7r _1 (7r(Li) D ^{L^j) C dE. 

To see this, note that by Corollary 18. 14[ for every line L 3 intersecting L\ and L 2 , there is 
a parallel line, L 4 , intersecting Li and L 2 , such that m(L 3 ,L 4 ) = ir(Li) fl 7r(L 2 ). Since by 
step 1), L 3 U L 4 C dE, it follows from Lemma [8.31 that 7r _1 (m(L 3 , L 4 )) G (9.E. 

For K = y(Li,L2) as in Lemma 18.91 Let U denote the component of IR 2 \ Y, which 
intersects both branches of Y. Every point of V — U\ (^(Li) U7r(L2)) is the intersection of a 
pair of distinct tangent lines of Y . Each such pair of lines lifts to a pair of skew lines L5, L$ 
in dE. By the Claim, 7r _1 (7r(L 5 ) n vr(L 6 )) G dE, so k~ 1 (V) C dE. Thus dE has non-empty 
interior, as required. □ 



9. Proof of the stability of monotone sets 

The proof of Theorem 14.131 i.e., the stability of monotone sets, has two main steps, A'), 
B'), which are quantitative versions of steps A), B), in the classification of monotone subsets 
of EI given in Section [SI In fact, A') is just Lemma [7.81 which is proved in Section [TT1 In 
this section, assuming assuming some technical geometric preliminaries which are proved in 
Section [TD1 we carry out step B'), thereby completing the proof of Theorem 14.131 

The proof of B') has substeps l')-3'), which correspond to substeps l)-3) of B). Substeps 
1'), 2'), are just Corollaries 17. 27[ [7.411 specialized to our specific context. Substep 3') requires 
a substantial preliminary discussion; see Section [TOj and in particular, Lemma 110.31 and 
Corollaries 110. 2 71 110.511 The point of the discussion in Section [10] is to quantify the following 
statement which occurs at the end of the first paragraph of the proof of substep 3): "Thus, 
we can assume that dE contains either a pair of parallel lines with distinct projections or 
a pair of skew lines". Once this statement has been quantified, the proof of substep 3') is 
completed by repeating, mutadis mutandis, the proof of substep 3). 
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The preliminaries to substep 3'), i.e., Section [TU1 constitute the part of the argument which 
necessitates assuming that E is e a - monotone on B e s(x), rather than on B\{x). The following 
example shows that such an assumption cannot be entirely avoided even if e a -monotonicity 
is strengthened to monotonicity. 

Example 9.1. Consider the set 

(9.2) E = {{x,y,z)\z<xy, y > 0} . 

We claim that if B r (p) C {(x, y, z) \ y > 0}, then EnB r (p) is monotone. This is equivalent to 
the statement that if a line L £ lines (H) passes through two points in d(E) — {(x, y, z) \ z = 
xy, y > 0}, then L is contained in dE. Note that the horizontal line L = (a,t,at) passes 
through (a, b, ab). If b > 0, then L is contained in dE. Conversely, if (a, b, ab), (a', b', a'b') e 
dE, then b, b' > 0. If these points lie on a horizontal line, since i?( ,6,a&) = (x, V, ab — bx + ay), 
we get 

(9.3) a'b' = ab — ba + ab' ; 

see (I2.15p . The solutions of (19.31) are: a' = a , V arbitrary, which gives the above line L, or: 
a' arbitrary, b' = —b, which contradicts the assumption b' > 0. 



Proof of Theorem 4-13. Recall that in Theorem I4.13[ we are given E C B\(x) which is 
e a -monotone on Bi(x). Our goal is to find a half-space ?CH such that 

C 3 ((EnB e s(x))AV) < 
C 3 (B e 4x)) ~ e ' 

For convenience of notation in the ensuing argument we will rescale the ball Bx(x) above by 
e~ 3 . Thus our assumption is that E C B e -z(x) is e a -monotone on B e s(x), and, for the sake 
of contradiction, that for no half-space V we have: 

C 3 ((EnB 1 (x))AV)<e. 

This contrapositive assumption will be used via the following simple lemma. In what follows, 
for A C EI and e > 0, the closed e-tubular neighborhood of A is denoted by: 



(9.4) T e (A) = {x G H : d H (x, A) < e} . 

Lemma 9.5. Let Q C EI be a 2-plane. Assume that for some e, r £ (0, |) a subset E C M 
satisfies: 

£ , r (£)n£i(x) CTJQ). 
Then there exists a half-space ?ci such that: 

C 3 ((EnB 1 (x))AV)<e. 



Proof. The set Bi(x) \ T e (Q) has at most two connected components, say Ci,C 2 (one of 
which might be empty). Since e < ~, by continuity, for j G {1, 2} we have Cj C mt e<r (E) or 
Cj C mt €ir (E'). For definiteness assume that G\ C int €tr (E). Let C C\ be an r-net in C\. 
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Thus the balls {B r / 2 {y)} y ^<yx are disjoint, and the balls {B r (y)} ye <yi cover C\. Moreover, since 
for each y G 91 we have y G mt £tr (E), we know that C 3 {B r {%i) R E') < eC%(B r {y)). Hence 



Cz{Ci nE')<J2 ^ 3 (B r (y) n £') < J> ■ C 3 (B r (y)) = 16e • ^£ 3 (S r/2 (y)) 
yeAT j/eAT yeoa 



16e-£ 3 (|J^/2(2/)) <16e-£ 3 (5! 



0) 



< 



This argument shows that for j G {1, 2} either C 3 (Cj fl < e or C,2,{Cj HE) < e. Thus we 
can take P to be either one of the half-spaces bounded by Q, or a half space that contains 
BAx). ' " ^ 



By virtue of Lemma 19.51 we can assume from now on that for every 2-plane P C EI we 
have: 



(9.6) d cie>ekl (E)nB 1 (x) <£T C1€ (P), 

where k± > 1, C\ > are constants which will be determined presently. 

By rescaling the metric d m by a suitable multiple of e~ 3 we can apply Corollary 17.411 with 
axe and u x e fcl+3 , and (after rescaling back to our present setting) deduce that provided 

a > 29 + 71(h + 3) = 242 + 71 A* , 

(9-7) k 2 < ^±5 , 

h 2 > 19 + 46(&i + 3) = 157 + 46fci , 

and e is smaller than a small enough universal constant, we can associate to every point 
y G d ee k 1 (E) fl B\(x) an open set O y of lines through the origin of H y , such that for all 
L G O y we have: 

(9.8) L ( [-c 2 e- 3 , c 2 e- 3 ] ) C d e u 2 ^ 2 (E) , 

for some small enough constant c 2 > 0. We have used here the fact that the definition of the 
quantitative boundary in (17. 3p is an open condition. 

As we shall see in Section [10] (see Lemma 110.11 and Corollaries 110.271 110. 44[ 110.511) the 
above discussion implies the following: If c\ in assumption (19.61) is a small enough universal 
constant, then there exists a pair of skew lines L, L' G lines(H), with the following properties: 

• The distance of L and V from x is at most f§e -3 - 

• Both L and L' intersect d e h 2 e k 2 (E) fl B C2€ -3/ 10 (x). 

• 8(L,L'), the angle between L and L', is bounded away from and tt, say, tan#(L,i7) G 
[§,§]; for the definition of 6(L, V) see fpTTTjl . 

• L, V have separation > e 3 , i.e. d v (L, L') > e 3 , where the vertical distance d v { ■ , • ) is as 
in©. 

We now use the lines L, V to perform substep 3'), that is, to produce a quantitative version 
of the argument in substep 3) of the classification of precisely monotone sets. 
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First of all, we will apply again Corollary 17.411 (rescaled, as before, by x e 3 ) to deduce 
from the fact that L and 1/ intersect d e h 2 fL k 2 (E) D B C2t -3/ w (x) that: 

(9.9) (L U L') n B C3t -,(x) C d eh3 ^ 3 (E) , 
where c 3 > is a universal constant, provided that: 

a > 29h 2 + 71{k 2 + 3) = 29h 2 + 71k 2 + 213 , 

(9.10) fc 3<^, 

h 3 > 19h 2 + 46(fc 2 + 3) = 19h 2 + A<6k 2 + 138 . 

Now, as in the non-quantitative proof, let Y = Y(L, V) C M 2 be the hyperbola from 
Lemma [H79l and let U be the component of M 2 \ Y which intersects both branches of Y. Let 
c 4 be a small enough constant, and take a point q G £7 such that B C4e 3(q) n R 2 C Z7 and 
rf H (g, 7r(L) Pi 7r(L')) < 10e 3 . As in the proof of substep 3) of the classification of monotone 
sets, using Lemma I83| any point z G B C4e 3(q) flM 2 is of the form z = tt(L*) R 7r(L**), where, 
provided C4 is small enough, L*,L** are skew lines which intersect L fl S cge -3(x) and V fl 
5 C3e -3(x), both of which are contained in d e h 3 e k 3 (E). Another application of Corollary 17.411 
implies that: 

(9.11) (L* U L**) n B C5e - :i (x) C d eh4>ek4 (E) , 
provided that: 

a > 29h 3 + 71(jfe 3 + 3) = 29h 3 + 71k 3 + 213 , 

(9.12) k 4 < 



k 3 + 3 



4 1 

h 4 > I9h 3 + A6(k 3 + 3) = 19h 3 + AQk 3 + 138 . 

We continue to argue as is substep 3). We use Corollary 18.141 to deduce that z = 
m (L*, L**), where (once more, provided C4 is small enough), L* ,L** are parallel lines which 
intersect the line segments L* fl B C5e -3(x) and L** fl -B C5£ -3(x), both of which are contained 
111 (E). So, another application of Corollary 17.411 gives: 

(9.13) (L* U L** ) fl 5 C6e -3 (x) C 9 £h5 j£fc5 (E) , 
provided that: 

(9.14) k 5 < 



a > 29h 4 + 71(k 4 + 3) = 29h 4 + 71k 4 + 213 
A; 4 + 3 



4 ' 

h 5 > 19h 4 + 46(k 4 + 3) = 19h 4 + 46k 4 + 138 . 

By Lemma 18.31 any point on the fiber Tt~ l (z) lies on a line which touches both of 
L*,L**. Moreover (using the explicit formula for this line, which is contained in the proof 
of Lemma f8 . 3 j) . since the vertical separation between L and L' is > e 3 , and z is an arbitrary 
point in B Cie a(q) fl IR 2 , a (final) application of Corollary 17.411 together with the ball-box 
principle, shows that there exists pel such that: 

(9.15) B r (p) cd £h6iek6 (E)nB e - 3 (x), 
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where r x e 3 , provided that: 

a > 29h 5 + 71(Jfe 5 + 3) = 29h 5 + 71k 5 + 213 

(9.16) k 6 < 



h + 3 



4 ' 

h e > 19h 5 + 46(k 5 + 3) = 19h 5 + 46A; 5 + 138 . 

Now, by choosing k\ sufficiently large, we will use (19 . 1 5[) to contradict Lemma [7^81 provided 
k\ is large enough. Modulo the proofs of technical lemmas which were postponed to the 
following sections, this will conclude the proof of Theorem 14.131 Before doing so, note that 
for the conditions (19"771) . (BTTDjl . (19~T2D . (19~T4D . (l9~T6l) to be satisfied we can ensure that (l9~T5l) 
is satisfied for, say, a = 2 i0 ki, k 6 = 2~ 10 A; 1 , h 6 = 2 30 A: 1 . (In actuality, these are big over- 
estimates.) 

It remains to show how to choose k\ so that (19.151) will contradict Lemma 17.81 Since 
E is e a -monotone on B e ~3(x), it is also x e a_24 -monotone on B r (p) (recall that r x e 3 ). 
By Lemma EH we can find p' G B r / 2 (p) such that p' ^ d a>r '(E), where a x e( a_24 )/ 2 and 
r' x r x e 3 . Without loss of generality assume that p' G mt a y(E), so that 

(9.17) C 3 (E' n B r/ (p')) < a£ 3 (B r ,(p)) * ef- 12 e 12 = e a / 2 . 
But, at the same time, p' G d e h 6 ^k 6 (E), which means that 

(9.18) C 3 (E' n 5 efc6 (p')) > e^^a (5 efc6 (p')) - ^ 6+4fc6 . 

In order for ( 19.181) to contradict ( 19.171) assume that e fce > r', which would hold for small 
enough e if k 6 = Jnj > 3. In this case B e k 6 (p') C B r >(p'), and the desired contradiction would 
follow (for small enough e) if | > h e + 4k e . Choosing k\ = 2 12 , and thus a = 2 52 and h e = 2 42 
yields the required contradiction, and completes the proof of Theorem 14.131 □ 



10. NONDEGENERACY OF THE INITIAL CONFIGURATION. 

In this section, we prove the assertion on nondegeneracy of the initial configuration, which 
was used in the proof of substep 3') given in Section [9j see the four items marked with bullets 
in paragraph following (I9.8p . Specifically under the assumptions of Lemma [10.31 below, we 
show that at distance < e -3 from x we can find a pair of skew lines in the controlled 
quantitative boundary which make standard angle as in (110.521) . and have separation > e 3 ; 
see flT03|) . (110531) . 

Given a pair of (unordered) skew lines L, L, define < 6(L, L) < tt to be the angle 
between 7r(L) and 7r(L) which faces the sector which contains the hyperbola Y(L,L) from 
Lemma [8]9j The vertical distance d v (L,L) is defined as in (18.21) . As in (18.151) . denote by 
b(L, L) the union of the (two) shortest horizontal segments joining L and L, and denote the 
length of each of these segments by ft(L, L). Then with the geometric interpretation of the 
multiplication in HI, we get 

(10.1) \ cot(i*(L, L)) ■ (9(L, Z)) 2 = (d v (L, L)f . 
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Since tan (f ) > f or a n q e [o, 7r ] j it follows from ffTOTD that 
(10.2) (*(L,L))* >sin9(L,L) ■ (d v (L,L)f. 

In what follows, for A C HI and e > 0, the closed e-tubular neighborhood of A is denoted 



T e (A) = {x G H : d (x, A) < e}. The required nondegeneracy of the initial configuration, 
namely the existence of the "quantitatively skew" lines L, L' that are described in the para- 
graph preceding (19.91) . is a consequence of the following Lemma flO. 31 The hypothesis ( 110.41) 
below corresponds to the assumption concerning 2-planes that we arrived at in (19.61) . and 
the discussion preceding (19. 8ft . Recall that T e (-) denotes the closed e-tubular neighborhood, 
as defined in ( 19.41) . 

Lemma 10.3. Fix x G HI, U C Bi{x) and e G (0, 1). For all y G U , fix an open set O y of 
lines through the origin of H y . Assume that for all 2-planes P we have: 



(10.4) U<£T e (P). 

Then there exist y,z G U and L y G O y , L z G O z , such that L y , L z are skew and: 

(10.5) (T)(L y , L z )f > sin (6(L y , L z )) ■ (d v (L y , L z )f > e 6 . 
yes 

Proof. Denote the Euclidean distance in M? by d( ■ , • ). Fix y\ G U. Let V denote the unique 
vertical 2-plane containing L yi . It follows from (j!0.4p . with P = V, that there exists yi G U 
such that 

(10.6) d(7c(L yi ),7T(y 2 ))>e. 

Let H = H q denote the unique horizontal 2-plane containing the line, L yi , and the point 
V2 $ Lyi- Note that q G L yi . By ( 110.41) . there exists y 3 , such that 

(10.7) d m (y 3 ,H q )>e. 
For arbitrary q, y 3 G H, we have 

(10.8) d m (q,H m )=d m (y 3 ,H q ). 

To see ( 110.81) . note that by left-invariance of d u , the left-hand side equals d n (y 3 1 ■ q, if( ,o,o))> 
and the right-hand side equals ^(g -1 ■ 2/3,^(0,0,0))- If 2 G #(0,0,0) is closest to y 3 1 • q, then 
z^ 1 G if(ooo); an d by (12.181) . ^((ys) -1 • q,z) = d m (q~ 1 ■ y 3 ,z~ 1 ). By symmetry, this gives 

flEEZJ. 

Since d m (q,L y3 ) > d m (q,H m ), from (fTrTTj) . (flTO . we get 

(10.9) d m (q,L m )>e. 

Since O yi , O y2 , O m are open, we can assume that no two of L yi , L y2 , L m are parallel. By 
applying a translation we may also assume that q = (0, 0, 0). Thus, after applying a suitable 
rotation we may assume that L yi is the x-axis and y\ = (a, 0, 0),y 2 — (b, c, 0) for some 
a, b, c G JR.. Set L yi PI vr(L y2 ) = n 3 = (s, 0, 0) and let a G (0, 7r) denote the angle between L yi 
and ir(L m ). 
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It follows from (110. 6p that |c| > e. Since \a — b\ + \c\ + \/\ac\ x d m (yi, y 2 ) < 2, we have: 
(10.10) |o|<- and \a - b\ < 1. 



Note that 

C C 6 

(10.11) sina = ——. — x r > — — 

d{-K{y 2 ),n 3 ) c+\b-s\ e+|6| + |s|' 

and by the geometric interpretation of the multiplication, 

(10.12) (d v (L yi ,L y2 )) 2 = \c\ ■ \s\ > e\s\. 

It follows by multiplying together (110.111) and fl 10 . 1 2[) that we may assume that e+ |6| > \s\, 
since otherwise (110.51) holds with e 6 replaced by e 2 . Using (110. 10p we deduce that \s\ < 
and therefore it follows from (110. lip that 

(10.13) sina>e 2 . 
Using (110.121) and (110.131) that we may assume that 

(10.14) d(n 3 ,q) = \s\<e\ 

since otherwise (110.51) holds. 

Let S denote the triangle with vertices 7i(L y2 )n7i(L y3 ) = rii, L yi n7r(L y3 ) = n 2 and n 3 . Let 
atj denote the angle at the vertex rij (thus a 3 is either a or n — a). Put £j = d(rij + i, nj +2 ). 
(Here and in the rest of the proof of Lemma [10.31 indices are taken mod 3.) It follows from 
(110.131) that there exists j G {1, 2}, such that 

(10.15) sinc^^e 2 . 

Write {k} = {1, 2} \ {j}. From (TT0l3"|l . ([10715]) . we may assume that 

(10.16) (d v (L yi ,L y2 )) 2 <e\ 

(10-17) (d v (L yk ,L m )) 2 <e\ 

since otherwise (110.51) holds. 

From the geometric interpretation of the multiplication, it follows that there exists r e 
{1, 2, 3} such that 

2 1 

(10.18) (d (L r+1 , L r+2 )) > area(S') = sin a 3 . 

By multiplying both sides of (110. 18j) by sin a r and using the law of sines to express sin a r in 
terms of sin a 3 , we obtain with (110.131) and the triangle inequality, 

(10.19) (d v (L r+1 ,L r+2 )) 2 sma r >(sma 3 ) 2 -mm{il£l} > e 4 • min {ij, i\] . 
Arguing similarly with (110.151) . we have: 

(10.20) (d v (L r+u L r+2 )) 2 sma r >e 4 -mm{£l£ 2 3 } . 
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We may therefore assume that min {£\ } £ 2 } < ^ and min {£\ } £^} < since otherwise (110.51) 
holds. By the triangle inequality £k < £j + £3, and therefore 

(10.21) £ k <\t. 

By adding (1X0141) . (flTUHll . (fiTTTTll . (TiTOl! . we get d M {L y3 ,q) < ±e + 0(e 2 ), which contra- 
dicts ( 110. 9p for e small enough. This contradiction completes the proof. □ 

In deriving the consequences of Lemma 110. 3[ we will work in coordinates. Thus, we write 
Li, L 2 for L y ,L z . As in (18.101) . we can canonically assume that: 

Li = (t, wt, c), L 2 = (s, —ws, — c) . 

As in (18. lip , the angle < 9(L 1 ,L 2 ) < n is determined by tan (~0(Li, L 2 )) = w. Let 
6 + (L,L), 6_(L,L) contain the shortest line joining L%, L 2 ; see (18.151) . Thus, 

and so, 

(10.22) d m (b + (L u L 2 ), b.(L u La)) = 2^ 
Put: 

(10.23) e it± (L u L 2 ) = U n 6±(L 1; L 2 ) , 



or equivalently, 
(10.24) 



ei )+ (Li,L 2 ) = ( \/^,y/cw,c 



e 2 ,+(Li,L 2 ) = ( J —,-y/cw,-c 



Then with the geometric interpretation of the multiplication in HI, (<i y (Li, L 2 )) 2 is twice the 
area of the triangle with vertices vr(Li) R vr(L 2 ), 7r(ei i+ (Li, L 2 )), 7r(e 2)+ (Li, L 2 )). It follows 
that (d y (Li,L 2 )) 2 = 2c. Note that 

5(L l5 L 2 ) = d m ((e 1: ±(L u L 2 ), e 2j± (L 1 , L 2 )) . 

Therefore, 

(10.25) f(L!,L 2 ) =2Vcw. 
Equation ( 110. ip now becomes: 

(10.26) (0(L 1; L 2 )) 2 = 2 W • (d v {L 1 , L 2 )) 2 . 

Corollary 110.271 below will be derived directly from Lemma 110.31 without further reference 
to the assumption on 2-planes. The proof will rely on the following additional information. 
Write oi = y and o 2 = z. Since oi,o 2 6 Bi(x), we have d m (oi,o 2 ) < 2, which implies 
d v (oi,o 2 ) < 2, d(7r(oi),7r(o2)) < 2. 
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We put 

02 = (S 2 , ~WS 2 , -C) , 

where without loss of generality, we can assume either case i), t\ > 0,s 2 < 0, or case ii), 
h >0,s 2 > 0. 

Below, to avoid confusion, we will write Q for the constant c, in (110. 5p . We also assume 
e < 1. 

Corollary 10.27. The following bound holds true: 

(10-28) c .^_> e 6. 

1 + w z 

Moreover, 

i) Ifh >0, Sl < 0, 

(10.29) f 6 < w < 



(10.30) e 3 v / TT^<D(L 1 ,L 2 ) < v^. 

(10.31) d m ( 0l ,e 1>+ (L h L 2 ))< 1 



, w 

(10-32) e 6 < c < 1 . 

(10.33) |ti| + |s 2 |<l. 

(10.34) \h + s 2 \ <-. 

w 

(10.35) w|*i|-|s 2 | <1- 
ii) Ifti >0,s 2 > 0, 

(10.36) w<e~ 6 . 

(10.37) eVl + w 2 < L 2 ) < V / TT^ . 

(10.38) rf H (o 1 ,e 1 , + (L 1 ,L 2 ))<e- 3 . 

1 



(10-39) 6 b < c < 1 + 



w 



(10.40) |ti-s 2 |<l. 

1 



(10.41) h + s 2 < 



w 



(10.42) |c-wt lS2 |<l. 
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Proof. Since sin0(Li, L 2 ) = 2 • ^)) 2 = 2c, ffT0T28l) is just a rewriting of (fT03|) . 

The lower bounds in (jflTSQI) . ffTOlJTl) . follow from (flO^gl) and ffT0T2"5l) . 

Next note that since d m (oi, 02) < 2, from the box-ball principle we have 



(10.43) |ti - s 2 \ + w\ti + s 2 \ + ^\2c - 2wt lS2 \ < 1 . 

Case i). Since t\ > 0, s 2 < implies that in the third term on the right-hand side of (110.431) 
we have — wtis 2 > 0, we get c < 1, which gives the upper bound in (I10.32p and since 
D(Li,L 2 ) = the upper bound in (I10.29[) as well. Using c < 1 and multiplying both 

sides of ( TT0~28D by ±±f^, gives ffTtEZil . Relations (JTESP-fllEED follow immediately from 
flELlHD and ti > 0, s 2 W < 0. 

To prove (1 1 . 3 1 j) . by considering the cases, w < 1, w > 1, and using the box ball principle, 
(110.431) . it suffices to show that 



W ~ \/W 



Since c < 1 we get < From fll0.33[) we get ti < 1, which gives the case w < 1. For 
w > 1, relations ( 110.34[) . ( 110.35H imply < which completes the proof. 

Case ii). Relation fll0.39[) was already shown in the proof of case i). By using ti > 0, s 2 > 0, 
relations fll0.40p - fjl0.42p follow directly from (jTD^5|> . From f|10.4ip . 

c^l+wt^^l + w-ih + s^ 2 , 

which, by (110.411) . gives the upper bound for c in (I10.39p . From this, the upper bound in 
(110.371) follows immediately, as does the implication c < 5 if w > 1. This, together with 
(110. 28[) . gives the upper bound for w in (I10.36j) . 

To prove (110.381) . note that from (110. 42j) . by dividing through by w and factoring, we get, 

1 



c . 

Vhs 2 

w 



VhS2 

w 



< 



w 



which, together with f 1 1 . 4 1 [) . gives 



:i+w) 



- - 1 1 

w 



< (l + w )-— + (l + w ) \^s 2 -t l 



wc 



From fll0.40p . in case w < 1, and (I10.4ip . in case w > 1, it follows that the second term on 
the right-hand side is < 1. By using the lower bound in (110.301) to bound the first term on 
the right-hand side, the proof is complete. □ 

In the next corollary, by considering separately the cases < w < 1, 1 < w < 00, we 
obtain estimates which depend only on e. 

Corollary 10.44. 

1) If0<w< 1, then 

(10.45) e 3 < d(L u L 2 ) <1, 
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(10.46) C / H (o 1 ,e 1 , + (L 1 ,L 2 ))<6- 3 . 
2) If 1 < w < oo, 

(10.47) e 3 < d m (b + (Li, L 2 ), b_(L 1 , L 2 )) < 1 

(10.48) e 3 <X>(L u L 2 ) < e ~ 3 . 

(10.49) rf H ( 0l ,e 1 , + (L 1 ,L 2 ))<e- 3 . 



Proof. Relation (I10.47p . i.e., the bound on 2y^, for w > 1, is a direct consequence of 
(110.291) . (110. 32p . (110.361) . (110. 39p . The remaining relations can be read off from Corollary 
[T03T1 □ 

If w > 1, as a consequence of (110.471) we obtain a pair of parallel lines b±(Li,L 2 ) whose 
separation, 2y^, is bounded below by > e 3 and above by a universal constant, and with a 
transversal Li, such that (I10.48p . (110. 49|) hold. This will suffice for our application. 



If w < 1, although 2y^j is not bounded above, 0(L 1; L 2 ) is bounded above by a universal 
constant, and below by > e 3 , and oF(oi, ei i+ (Li, L 2 )) is bounded by a definite multiple of e~ 3 . 
In this case we obtain a pair of skew lines making standard angle and controlled separation 
as follows. 



Let 



VT 



w 



2 



W 

denote the distance from the origin in M 2 to (a/-^, ±y/cw)) = 7r(ei ) ±(Li, L 2 )). The points 



— , y/CW 



*(L u L 2 ) + r VV^' V ; ' 
lie on the projection of a line, L G lines(if), with £j = L fl Li, where 

<F(z 1 ,e 1)+ (L 1 ,L 2 ))=*(L 1 ,L 2 ), 

d m (z 2 , e 2 , + (L 1; L 2 )) = / • D L 2 ) . 

r + 0(Li,L 2 ) 

By direct computation, the slope m(L) of 7r(L) satisfies 

(10.50) -<m{L)<-. 

3 2 

From this we get: 

Corollary 10.51. The lines L, b + (L\,L 2 ) are skew. The point tt(L) fl n(b + (Li, L 2 )) lies on 
the line segment from 7r(ei ;+ (Li, L 2 )) to 7r(e 2j+ (Li, L 2 )). Moreover, 

(10.52) l<t a n6(L,b + (L u L 2 )) < ^, 

5 2 
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and 

(10.53) J| • 0(Li, L 2 ) < 6+(L 1; L 2 )) < L 2 ) . 

Proof. Relation (110.521) follows directly from (110. 50[) . By the discussion above, the tri- 
angle with vertices, (0, 0), 7r(ei i+ (Li, L 2 )), 7r(e 2i+ (Li, L 2 )), and the triangle with vertices 
(0, 0), n(zi), tt(z 2 ), have equal areas. It follows that the triangles with vertices, 7r(ei i+ (Li, L 2 )), 
7r(zi),7r(L) n 7r(6 + (Li,L 2 ), 7r(e 2j+ (Li,L 2 )),7r(z 2 ),7r(L) n tt(6 + (Li, L 2 ), (whose union is the 
symmetric difference of the previous ones) have equal well. From this and the 

geometric interpretation of multiplication, we get (110. 53j) . □ 



11. Proof of Lemma [TT81 

In this section we prove Lemma 17.81 We begin with some measure theoretic preliminaries 
which play a role here and in Section [T2J Then we give the proof for case of precisely 
monotone sets (for which the preliminaries are not required) ; compare also the proof in [23J . 
Finally, show how to modify the argument to obtain the general case of Lemma 17.81 

Measure theoretic preliminaries. Recall that £3 denotes Lebesgue measure on 1 = I 3 
which is a Haar measure of EL Given L G lines (H) we denote by 7i\ the Hausdorff measure 
induced by the metric d m on L. Recall that M denotes the unique left invariant invariant 
measure on lines(H) which is normalized so that jV(lines(Si(p))) = 1. Given L G lines (H) we 
denote by [L] its Heisenberg parallelism class, i.e., [L] = {gL : g G H}. HI acts transitively 
on [L]. Therefore there exists a left invariant measure //rn on [L], normalized so that for 
every measurable icHwe have 

(11.1) / H l L ,(L'nA)d m (L') = jC 3 (A). 

J[L] 

The space of all parallelism classes will be denoted ^3 = {[L] : L G lines (H)}. Each such 
parallelism class [L] is uniquely determined by an angle 9 G [0, 2tt), corresponding to the 
angle of 7r(L) C M 2 (recall that the lines L G lines(H) are oriented). We thus there is an 
induced measure d9 on *p which corresponds to the standard measure on the circle S 1 . By 
uniqueness there exists a constant c > such that for all integrable / : lines (Et) — > M we 
have: 

(11.2) / fdAf = c[([ f(L')d m (L')) dd{[L]). 

Jlines(H) J \J L'£[L] J 

Let P C lines(H) x H denote the space of oriented pointed lines, i.e., P = {(L, x) : x G L}. 
We shall use below the measure v on P which is defined by setting for every compactly 
supported continuous / : P — ► M: 

(11.3) ffdv=[ ( f f(L,x)dH l L (x)) dAf(L). 

JF Jlincs(H) \JL J 
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In the proof of Lemma [7751 we shall use the space of configurations C, where a configuration 
is a quadruple (L,xi,x 2 , [L']), where L G lines (H), xi,x 2 G L, and [L'} G (i.e., a doubly 
pointed line and a parallelism class) . C carries two measures o\ , a 2 , which are defined as 
follows. Given / : C — > M which is continuous and compactly supported let: 
(11.4) 

ffda 1 =f [ ([ f(L,x 1 ,x 2 ,[L'])d(n 1 L xn 1 L )(x 1 ,x 2 ))d6([L'])dAf(L), 

JC Jlines(H) J <p \J LxL J 

(11.5) f fda 2 

f(L(x),x,L(x)nL 2 ,[L'])dn 1 Ll (x))d(ix [L/] x» lLI] )(L u L 2 )) d6([L']), 



c 



'<P \J[L']x[L'] \JLx 

where in ( lll.5p . as in Sectional given two parallel lines Li,L 2 G [L'\ with distinct projections 
and x G Li, L(x) denotes the unique element of lines (M) which passes through x and 
intersects L 2 . (In Section [HI L(x) D L 2 was denoted x*). 

Each of 01,(72 is absolutely continuous with respect to the other. Moreover, given a 
compact subset of the Heisenberg group K C EI and a > 0, let C(if, a) denote the set of 
configurations (L,x\,x 2 , [L']) G C with Xi,X2 G if and d (^(xiL^O) -1 ^'), n(x 2 L' {Q)' 1 !^,')) > 
a, i.e., the unique lines in [L'} which pass through x%,x 2 have projections of distance at 
least a, and x\,x 2 are in the compact set K. One checks from the definitions that on the 
compact set C(K,a), the Radon-Nikodym derivatives ^ are continuous. Hence for 
every measurable A C C(if, a), we have 

(11.6) <T X (A) -a 2 (A), 

where the implied constants depend only on K and a. This observation will be used in the 
proof of Lemma 17.81 below. 

A consequence of small nonconvexity. Recall that the nonconvexity NCb t (x){E, L) was 
defined in (14.71) . The following consequence of small nonconvexity will be used in the present 
section and repeatedly in Section [12] (where the main result concerns 5-convex sets, which 
are more general than 5-montone sets). 

Lemma 11.7. Fix p G H and E CM. Let L G lines (H) satisfy NC Bl ( P )(E, L) < 5. Assume 
that [c, d] C [a, b] C L H Bi{p) and 

(11.8) Hl([a,c]nE)> 5, H\{[d, b] n E) > 5 , 

then 

n 1 L ([c,d\nE f ) <5. 

Proof. For all sufficiently small 77 > 0, we have Hi{[a, c]f)E) >S + 7], Hl([d, b}nE) >5 + rj. 
If / C L H Biijp) is an interval which exceeds the infimum on the right-hand side of (14.71) 
by at most rj/2, then the intersection of I with both [a, c] and [d, b] must have positive 7i\ 
measure. Thus, I = [e, /] D [c, d]. By the choice of I, we also have TC\([e, f] HE') < 5 + rj/2, 
which implies 7"^([c, d] fl -E") < 5 + rj/2. Letting rj tend to completes the proof. □ 
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The case of precisely monotone sets. We call a set E C B\(p) precisely monotone 
if E fl L and E' f] L are connected for all L G lines (H). Therefore, given such a pair 
(£", L), either L fl £>i(j?) C E, L fl -Bi(p) C -E', or there exists a unique G L such that 
(L \ {qL,E}) H -Bi(p) consists of two open intervals, one of which is contained in E and the 
other of which is contained in E' . 

Choose a pair of parallel lines, Li,L 2 with distinct projections and recall that X = 
X(Li,L 2 ) denotes the ruled surface which is the union of lines, L(x), passing through L\ 
and L 2 . Here, {2} = L(x) fl L\. Note that H acts on Li, L 2 , L 2 ) by left translation. 

Lemma 11.9. Fixp G HI. There is a left-invariant function defined on pairs L 1; L 2 of parallel 
lines with distinct projections and taking values in 6-tuples of relatively open subsets of the 
surface X(Li,L 2 ) fl Bi(p), {Wj(Li, L 2 )} 6 j =1 , such that if E is a precisely monotone subset 
ofB 1 (p), then for some j{L u L 2 , E) G {1,2,3,4,5,6}, W(L U L 2 ,E) : = W j(LlM , E )(Li, L 2 ) 
consists entirely of points of E, or entirely of points of E' . 

Proof. We can assume that say L\ = (t,b, —bt), L 2 = (s, —b, —bs), with | < b < 1. Then, 
in the notation of Lemma 18.31 for x = (t,b,—bt) 6 L\, we have x* = (—t,—b,bt), L(x) = 
(r, r/b, —bt). 

Let Ii, I 2 , 13, denote the intervals, (0, |), (|, |), (|, |), respectively, and let —Ij be defined 
by t E —Ij if and only —t G Ij. Define W / i(L 1 ,L 2 ) to be Bi(p) intersected with the union 
of those segments of the lines L(x) which join x G -^i(Zi) to x* G L 2 (— 1\). Also, define 
W / 2(Li,L 2 ) to be Bi(p) intersected with the union of those subrays of the lines L(x) for 
which x G Li(ii), x* G L 2 (— whose endpoint is x and which are disjoint to the segment 
[x, x*]. Define W${L\, L 2 ), W±{L\, L 2 ) and Ws(Li, L 2 ), We(Li, L 2 ) analogously, corresponding 
to j = 2, 3, respectively. 

There exists some i(Li, L 2 , E) G {1,2,3}, such that qEM & Li(Ii) and qE,L 2 ^ Lzi—h) 
(where possibly, one or both of ^em^^M d° n °t exist altogether). 

Fix i = i(Li, L 2 , E) as above. There are four possibilities: 

la) Li(Ji) C E, Lx(-Ii) c E, 
lb) L^cE', Lxi-I^cE', 
2a) L^Ii) C E, L 2 (-Ii) C E>, 
2b) Lx^cE', L 2 (-h)cE. 

In cases la), lb), we can take j(Li, L 2 , E) — 2i — 1, while in cases 2a), 2b), we can take 
j(L 1 ,L 2 ,Bf) = 2i. - ^ 

Take a small interval [0, d] in the center of H and consider all left translates g(X(Li, L 2 )) = 
X(g(Li), g(L 2 )), where g G [0, d). Clearly, for some subset S = S(Li, L 2 , E) C [0, d], of 
measure > A, if g G S, then the integers j(g(Li), g(L 2 ), E) will all coincide and in addition, 
either for all g G 3, W(g(L 1 ), g(L 2 ), E) C E, or for all g G S, W(g(Lx) : g(L 2 ) : E) C E' . 
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Without essential loss of generality, we can assume G S and say W(Li, L 2 , E) C E. Hence, 
W(g(Li),g(L 2 ), E) = g(W(L u L 2 , E)) C E, for all g G S. 

Choose an interior point y G W(Li, L 2 , E) fl Bi(p), lying at a definite distance from the 
boundary, choose a line L(x) such that y G L(x), and choose L G lines (H) making a definite 
angle with L(x). Let [L] denote the collection of lines parallel to L. By Lemma 18.31 (and 
continuity) we can assume that d has been chosen so small that if g G [0, d], Li G [L] and 
L' fl g(X(Li, L 2 )) fl Biodiy) 7^ 0, then L' intersects g(X(Li, L 2 )) transversely. Parameterize 
each such V such that L'(0) — L' D W(L\, L 2 , E). Let J denote the smallest interval 
containing S. Then the length of J is > ^ and by monotonicity of 22, we have 27(J) C E. 
This completes the proof in the precisely monotone case. 

Proof of Lemma 17.81 Suppose now that E C Bi(p) is e 2 -monotone on Bi(p). Choose 
a measurable mapping L i— > 2^ C L (e.g., via an application of the measurable selection 
theorem in [H]) such that Il,e H Bi(p), (Il,e)' H 2?i(p) are intervals and for almost all L, 

(11.10) Wi(J i)E A (22 n L n S!(p)) < 2 • NM Bl(p) (L, 22) . 

Let denote the common boundary point of Il,e, {Il,e)'- 

Define Ii,I 2 ,I 3 as above. Given a pair Li,L 2 of parallel lines with distinct projections, 
define i(Li, L 2 , 22) as above, but replacing q LjE by Then, mutatis mutandis, define 

cases la)-2b) and {Wj(L 1 , L 2 )}® =1 , j(L 1 , L 2 , E),W(Li, L 2 , E) as above. 

In the Claim below, Center (H) is equipped with its natural measure C and W{L\, L 2 ,E) C 
X(Lx,L 2 ) is equipped with the natural surface measure M. on X(Li,L 2 ). 

Claim. There exists a universal constant c > with the following properties. Assume that 
22 C Bi{p) is e 2 -monotone on B\(jp). Then there exists a pair of parallel lines L\,L 2 whose 
projections lie at distance > c, such that for a fraction > c of g G Center (H) fl Bi(p), the 
surface W(p(Li), g(L 2 ), 22) has measure > c and, apart from a subset of measure < e/c, it 
consists either entirely of points of 22 or entirely of points in 22'. 

Assume provisionally, that the Claim holds. Since 22 C Bi(p) is e 2 -monotone, by (111.21) 
with f(L) = NMflj (p)(22, L) and Markov's inequality (13.151) . we can choose a parallelism class 
[L] of lines which are all a definite amount transverse to X(Li,L 2 ) (i.e., the angle between 
these lines and the surface X(Li,L 2 ) is larger than a universal constant) such that apart 
from a subset of measure < e of lines in [L], the remaining lines are all < e-monotone. From 
this, together the Claim and Lemma [1 1 . 7\ we directly obtain Lemma [7.81 

Proof of the Claim. Rather than studying the space of surfaces X(Li,L 2 ) directly, it is ad- 
vantageous to decompose each such X(Li,L 2 ) into its collection of ruling lines, L, and 
then to lift considerations to space of all triples (L±, L 2 , L), where a triple (Li,L 2 ,L) is 
a set of lines Li,L 2 ,L G lines(2?i(p)), such that Li,L 2 are parallel in the Heisenberg 
sense, and L\ fl L = x±, L 2 fl L = x 2 are nonempty and lie in Bi(p). Recall the set 
of configurations C that was defined in the paragraph following flll.3p . Each configura- 
tion (L,xi,x 2 ,[L'\) G C with Xi,x 2 G B\(p) determines, and is determined by, the triple 
(Li, L 2 , L), where L x = xiL'ify^L' and L 2 = x 2 2/ / (0) _1 L / (thus using previous notation, 
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L = L(xi) and x 2 = x\). Let C\ denote the set of configurations (L, xi,x 2 , [L']) G C with 
xi,x 2 G Bi(p). 

Definition 11.11. A pointed line {L,x) is called consistent if one of the following two 
options holds true: either x G Il,e an d x G E, or x G (Il,e)' H ^(p) and x E E' . A 
configuration (L, x%, x 2 , [L']) is called consistent if (L 1? xi), (L 2 ,x 2 ), (L,x 2 ) are all 

consistent, where L\ = xiL'^O)^ 1 ^ and L 2 = x 2 L'(0)~ 1 L'. Such a configuration is called 
e-monotone on B\{p) with respect to i£ if Li, L 2 , L are all e-monotone on -Bi(p) with respect 
to E. 



Let G e denote the set of configurations which are consistent and e-monotone on Bi (p) and 
let (G e )' = Ci \ G e . By using Lemma Til. 7\ it is immediate to verify from the definitions that 
if for some parallel lines L 1; L 2 G lines(i?i(p)) we have: 

(11.12) ?4 (x G L x (/, (il)i2jB) ) : (L(x),x,x*, [L x ]) G (C e )') < e, 

then apart from a set of measure < e, the set W(Li, L 2 , E) consists either entirely of points 
of E or entirely of points in E' . 

Lemma 11.13. If E is e 2 -monotone on B\(p) then: 

(11-14) ^((Gyo^e, 

where o~\ is defined as in flll.4j) . 

Proof. We use the notation introduced in our discussion of measure theoretical preliminaries 
at the beginning of this section. Fix L G lines (Bi(p)) and let G'(L,E) denote the set of 
x G L D B\(p) such that (L,x) is not consistent. By definition, we have 

(11.15) nl(G'(L,E)) <2-NM Bl(p) (L,E), 



so 



(11.16) / Hl(G'(L, E)) dAf(L) < 2 • NM fll(p) (£) < e 2 . 

Jlines(Bi(p)) 

For x G Bi(p), denote by m(x) measure (with respect to d6) of the set of those [L] G V 
such that for the unique V G [L] such that x G L', the pointed line (L',x) is not consistent. 
Then by ffTTTTj) and flTO) we deduce from (TiTTBD that 

(11.17) / m(x)dC 3 (x) <e 2 . 

yBi(p) 

For xi,x 2 G L, define F(xi,x 2 ) by 

(11.18) F(xi,x 2 ) = m{x x ) + m(x 2 ) + X/i, B A(BninBi(p))(^i) + X/ L , E A(£nLniMp))( x 2) • 
Then, 



;il.l9) / (I F{ Xl , x 2 )d (Hi x Hi) (x u x 2 )) dAf(L) 

Jlines(Bi(p)) V J x 1 ,x 2 &LnB 1 (p) / 

l lll. 171 1 r mi. ion 

< e 2 +/ H\(I l ,e A (E n L n Bi(jp))dN(L) < e 2 + NM fll(p) (£) < e 2 

J lines (Bi (V)) 



'lines(Bi(p)) 
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It follows from fl 11 . 1 9 [) and (111.41) that if we denote by A C C the set of configurations 
(L,xi,X2, [L 1 ]) which are not consistent then <yi(A) < e 2 . 

Let B C C denote the set of configurations (L,xi,x 2 , [L']) which are not e-monotone on 
Bi{p) with respect to E. This implies that for iL,x\,x 2 , [L'\) £ B we have: 

NM Bl(p) (L,E) + NM Bl(p) (i 1 L'(0)- 1 L',£) + NM Bl(p) (x 2 L'(0)- 1 L',£) > e. 

Hence, using Markov's inequality (13.151) . combined with the identities (111.4j) . (111.21) . we 
deduce that 

gwg < f 

e 

Since, by definition, o"i((G e )') < 0\iA) + ai(-B), the proof of (111.141) is complete. □ 

Set a = and recall that, as defined before (lll.6p . that C = C(£>i(p), a) is the set of all 
configurations (L, x±, x 2 , [L 1 ]) £ C such that xi,x 2 £ -Bi(p) and the projections of the parallel 
lines xiL '(O) -1 V ', x 2 L'iO)~~ 1 L' have distance at least a. It follows from (111.61) and ( 111.141) that 

02 (^iG e ) f fl C j < e, where the measure 02 is as in ( 111.5p . It follows from ( 111.5|) that there 

exists a parallelism class \L'] £ *p such that if we define 

F = {L u L 2 e [L'\ : d(7r(Li), tt(L 2 )) > a} , 

then 

(11.20) / ^i i (x£L 1 n J B 1 (p) : (L(x),x,x*,[L , ])£(G e )')^(^']Xm'])(^i^2)<e. 

Partition the set of all (L 1; L 2 ) £ Fx F into equivalence classes, where (L 1; L 2 ) is equivalent 
to (L^Lj) if there exists g £ Center(HI) such that L[ = <?(Li),L 2 = 5 , (L 2 ) We deduce 
from (I11.20P that there exist Li,L 2 £ F, an index j £ {1,2,3,4,5,6}, and a subset D C 
Center(H) fl -Bi(p) of measure > 1, such that for all p £ D we have jig{Li), giL 2 ), E) = j, 
the surface W{g(Li), giL 2 ), E) has measure > 1, and, by another application of Markov's 
inequality ( 13.151) . 

U\ x (x £ L^Ij) : iL(x),x,x*, [L x ]) £ (G e )') < e. 

The claim now follows from the discussion following (11 1.12ft , and hence the proof of Lemma 
17.81 is complete. □ 

12. Proof of Proposition 17.101 

In this section we will consider both lines in R 2 and their horizontal lifts to H. It will 
be convenient to use notation, which differs somewhat from that employed elsewhere in the 
paper. This change will also prevent an undesirable proliforation of subscripts, which would 
result if we continued to write L (exclusively) for horizontal lines in HI. 

Let 7r : H — > IR 2 denote the canonical map . Given a line segment T\ C M 2 with Ti(0) = 
(0, 0), denote by T\ its unique lift to a segment of a horizontal line in EI emanating from the 
origin. Similarly, consider a once broken line segment t\ U t 2 , i.e., a pair of line segments 
such that the initial point of the line segment r 2 is the final point of T\. Again, we assume 
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Ti(0) = (0, 0). Denote by T\ U t 2 the lift of T\ U t 2 to a continuous broken horizontal line in 
HI = R 3 , emanating from the origin. 



In place of the line L in Proposition 17.101 we will write 7. Since both the statement and 
proof are somewhat technical, for purposes of exposition, we first consider the model case in 
which 7(0) G E, .80(7(1)) C E and E is precisely convex. In this case, we will show that 
for a large enough universal constant C, there exists c > such that B csr ^f{s)) C E, for all 
< s < 1. 

Recall that in Proposition 17.101 we are given parameters K,T],£,r,p G (0, 1) such that p < 
\nr 2 and p < c, where c G (0, 1) will be a small enough universal constant to be determined 
below. For the reader's convenience we recall here the values of 81,62 in Proposition 17.101 
namely (17321) and (1733]) : 

(12.1) Sl = , 

r 

(12.2) 5 2 = cK 6 rj 3 £ 2 p 3 r 6 • 

In the proof we will also use an auxiliary parameter u > 0, which will be given by: 

(12.3) uo = CK 3 r]£r 2 , 

where c > y/c will be an appropriately chosen universal constant, 
namely (I7.12p and AT. 13j) . we have: 



Values of constants. For the reader's convenience we include here a summary of the 
steps of the proof of Proposition 17.101 in which the particular values of the constants enter 
explicitly. For the constraint p < \nr 2 , f TT2~T8l) . (1T23TD . CEMTj) ; for lu, ([12730]) . (fT2738|) . 
( 1127591) : for <5 1; (rT2730|) . (1T235|) : for 5 2 , (TT238D . (TT23T]) . 



Proof of model case. For reasons of exposition, we begin with the proof of the model case 
described above. 

Remark 12.4. Even though the statement of Proposition 17.101 pertains to balls, the multi- 
plicative structure of EI and its relation to the geometry will necessitate the introduction of 
certain product cylinders, where the product structure corresponds to EI = R 2 x K. In fact, 
our argument will show that if a cylinder centered at 7(1) wtih base radius r 2 and height 
r, is contained in E, then a cylinder with base radius > sr 2 and height sr centered at 7(5), 
will be contained in E. Here, both heights are measured with respect to the metric d m . 

Remark 12.5. Let Oq be as in Section UJ see the subsection titled: "The Heisenberg group 
as a PI space". By applying a suitable left translation and action by Og, we can assume 7 is 
the line (s, 0, 0). In the present subsection and in the proof of Proposition 17. 101 we consider 
only points (a, b, c) G -B csr (7(s)) with c > 0. Points with c < are handled by the symmetric 
argument i.e. by interchanging the roles of and ip below. 

Let C R 2 denote angular sector which is the union of rays r such that r(0) = 7(0) = 
(0,0) and r(l) G £> r / 2 (7(l)), where here B r / 2 ( r y(l)) denotes the Euclidean disk in R 2 . Let 
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denote the union of the lifts of rays r C 0. By the convexity of E, it follows that r C 
implies r(s) G E, for < s < 1, where s denotes arc length. To prove the non-quantititative 
version of ( 17.161) . it suffices to show show that for any r C 0, 

(12.6) n-\r(s))nB csr (T(s)) C E. 
Set 

(12.7) A = arctan(r 2 ) , 

and let 0, ip denote the rays in which make an angle A with r. For the sake of proving (112. 6p . 
by applying a suitable rotation, assume that r is the x-axis, so that we have r(s) = (s, 0, 0). 
Choose the parameterization u such that 0(w) = (u, — w 2 ,0), if)(u) = (u,ur 2 ,0). 

Fix s G [0, 1] and let u be the unique value of u such that the line segment x, from 4>(u ) 
to "0(1)) intersects r at t(s ), where x(0) = <f>(uo). Thus 

(12.8) «o = =^_. 

Let 0i denote the segment of from r(0) = 0(0) to 0(ito). Let x denote the segment of 

01 U % which lifts x- Choose the parameterization t such that 

(12.9) x(t) = uo(l, -r 2 , 0) • t ((1, r 2 , 0) - u (l, -r 2 , 0)) , 

where above, as usual, "•" denotes multiplication in H. If we define <io by x(^o) = 7 "(so) ) 
then from (112.91) . it follows that 

(12.10) d = |, 

/ 2s 2 • r 2 

(12.11) x(d )= Uo, 0,-5 

V ^ — s o 

By the assumed convexity of E we have 0(w) C E, for all < u < 1. From (112.91) . for 
t = 1, and the box-ball principle, (12.201) . we can assume that C has been chosen such that 
x(l) G .60(7(1)) G E. Since E is convex, this gives x(d ) G E. Similarly, by (12.201) . for 

2sl ■ r 2 



< v < 



2 -s 



the line (0, 0, —v) ■ x, intersects QC\E and satisfies, (0, 0, — v) ■ x(l) G Bc r (y{l)) C -E. Again 
by convexity of E 1 , we get (0, 0, — f ) • x(d ) e ^ ^ s straightforward to check that this 
implies that B crso (r(so)) C E for some universal constant c > 0, as required. 

Remark 12.12. In our proof of Proposition 17. 10[ it is necessary to replace x by a 1-parameter 
a family of lines x$i t° t> e defined below. To this end, we observe that there is some flexibility 
in the definition of x- First note that from (112.71) . (112. 8p . (112.101) and the law of sines, it 
follows that for some universal C\ > we have 

(12.13) Cl -r 2 <Z(7(s ),x'(d )). 
For all 

(12.14) 0<9<c 2 -r 2 , 
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with C2 > a small constant, let xe denote the horizontal line through x(da) = r (so) 
obtained by rotating x by an angle 6 about the point x{do), in the clockwise direction. The 
constant C can be chosen such that xe intersects -60(7(1)). It will also intersect 6 fl E. 
In fact, if q = g(r(s )) G 5 csor 2(r(s )), then g(xe) intersects both Pi E and _Bo(7(l)). 
It follows that for any q G B csor 2(r(so)), and any xe with 9 as in (112. 14j) . in verifying that 
7r _1 (7r(g)) fl B crso (r(so)) C E, we can use only the lines which are parallel to some xe. 

Intermediate case and proof. Before going to the proof of Proposition 17. 10[ we consider 
a slight generalization of the model case. As before, assume that .60(7(0)) G E and E is 
convex. Assume in addition that #(7(0)) G E for some g, with 

(12.15) rf H (^(7(0)),7(0))<P, 

where recall that we are assuming that p < nr 2 . The box-ball principle (12.201) . together 
with (112.151) . implies that for all s > we have 

(12.16) d u tf(s),gtf(s)) < max {p,^p}. 

Since p < r 2 it follows from (112.161) that provided C is large enough, the assumption 
£>o(7(l)) Q E implies that Bi^g^^l)) C E. Now, a computation like those above shows 
that for all < s < 1 we have 

(12.17) B csr (g^(s)) C E for all s G [0, 1] , 

where c > is a universal constant. It follows from (112. 15j) . the bound p < nr 2 , and (112. 17j) . 
that there exists a constant d > such that 

(12.18) Bi csr (7(s)) C E , for all s G [c'k, 1] . 

Remark 12.19. It is worthwhile to emphasize here that in order to obtain (112.181) . we must 
take p proportional to r 2 (rather than to r); compare also the use cylinders, -B csr 2(r(0)) x 
\—(sr) 2 , (sr) 2 ] in the model case above and below. 



Proof of Proposition \7Ad] The arguments in the model and intermediate cases use 0c£, 



respectively, g(Q) fl .61(7(0)) C E. Since in the general case, our hypotheses are measure 
theoretic, we will employ integral geometry and in particular, the measure theoretical pre- 
liminaries given in Section [TTJ 

The first (and most involved) step is to show the existence of a thickened version § of the 
angular sector, g(O) C E, of the intermediate case: 

(12.20) s=(J*C(e), 

for some g G H, where A is a suitably chosen subset of the center of H such that g({j{0)) G 
£p(7(0)) n E for all ( G A. We think of § as a "thin slab". 

The slices g({Q) of S will be shown to consist almost entirely of points of E. Hence, 
the vertical structure exhibited in (112.201) implies that the mass of E fl § is very evenly 
distributed. In the presence of sufficiently small nonconvexity of E, this compensates for 
the fact that the mass of E fl §, while subject to a definite lower bound depending on r\ (see 
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(112.241) ) might be still be very small, which might otherwise prevent § from serving as an 
adequate substitute for 6. 

The following lemma, summarizes the essential properties of S. Eventually, we will be 
interested in the behavior of the intersection of § with the cylinder (with coordinate descrip- 
tion) 

B csr 2(r(s)) x [-c • (s • r) 2 , c • (s • r) 2 ], 

whose intrinsic height is 2{rs) and whose measure is x (s • r 2 ) 2 • (s ■ r) 2 . This may help to 
explain the right-hand sides of (flT2^ - (fT2T25]) . 

Lemma 12.21. There exists jGi satisfying ( 112.151) and A C Center (H) such that for E> 
as in HEM) : 

a) 

(12.22) § c g (M 2 x /) , 
for some I C [— p 2 ,p 2 ] C Center(H) ; with 

(12.23) £(/)x£p 2 , 

where £ denotes the Hausdorff measure on the center of M, with its induced ^-snowflake 
metric. 

b) S> has a definite thickness: 

(12.24) C(A)>^p 2 . 

c) S> consists almost entirely of points of E, i.e., for all ( G A: 

(12.25) C 2 ((<?( (e) \ (B p (7(0)) U S P (7(1)))) n Bf) < cor 2 . 

Granted Lemma 112.211 the proof of Proposition 17.101 is not difficult to complete. We 
will now give a brief overview of the argument to come. Since E is 52-convex, we can 
chose xe , as in Remark 112.121 such that for a fraction very close to 1 of lines L G , the 
nonconvexity, NCb Cj .(^(o)){E, L) is very small. By a quantitative version of the proofs of 
previously considered model and intermediate cases, it follows that such lines consist almost 
entirely of points of E. The argument uses (I12.24p . (112.251) . and Lemma \1 1 . 71 b elow . In fact, 
Lemma 111.71 enters, in similar fashion, in the proof of Lemma 112.211 

Proof of Lemma \12.21\ In the next few subsections we will show that for most points, 
(7(7(0)) G E fl i? p (7(0)), the sector g(6) consists almost entirely of points of E. The re- 
mainder of the construction will be a simple consequence of this fact. 

Implication of almost full measure of E in Bc r (7(1))- By the assumption (17.141) we 
have: 

(12.26) C 3 (E n 5,(7(0))) > rj ■ C 3 (B p tf(0))) > V p 4 . 

Let C > be the (large enough, but fixed) constant that was chosen in the intermediate 
case of the proof. Assume as in (17.151) that L(l) G int^o^)- chosen to be a large enough 
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universal Recall that the parameter u is defined in (I12.3p . At the end of the proof, it will 
become clear why it is necessary to choose this value of u; see (112.301) . (112.381) . (112. 59[) . 

For r C 0, set 

(12.27) s m = {t g [t\ I o < n\[r nsn 5,(7(0))) < u ■ v ■ P } , 

(12.28) 

T m = jf G [r| 0< H\(rnE n 5,(7(0)) and U\ (fnEn Si r (7(l))) < ■ 

Without loss of generality, we can assume that k is small enough that if f G then 
f(l) G Bi r (7(l)); see QTXTB1) . From (jTHSJ), JZ32), ffTTB . we get: 

(12.29) /^Pf?]) <^ 3 <a;77p 3 , 

provided that the following lower bound on w holds: 

5i ( r 



(12.30) w > 

V VP, 

An inspection of our choice of parameters (112. ip . (112.31) shows that (112. 30p does indeed hold 
in our setting. (Note that here, we used the requirement c > y/c > c.) 

Estimates in the space of pointed lines. Recall that v denotes the measure on P, the 
space of pointed lines, as in (lll.3p . Let 

U* = { (f, f (0)) f(0) G E n £ p (7(0)) and f C f (O^O)" 1 ( 



(12.31) S# = {(f, f(0)) J r G S n 0(9), f(0) = 0(7(0)) G 5 H 5,(7(0)) } 



Then 
(12.32) 

(12.33) 

(12.34) 



T* = |(f,f(0)) f(0) G E D 5 p (7(0)) and f G T [f] and f C f(0)7(0) -1 (©)} • 

^ ([/#) x r 2 £ 3 (5 n 5,(7(0))) ^^rV, 
v (S*) ^ 2 p 4 , 
v (T # ) < c^r 2 p 4 . 



Relation flEZg) follows from ([TT7T]) . 0TTT2"j) . OTOD . OTP) and 1111 . Relations (HZ 
012.341) follow similarly, using 012.271) and 012.291) . respectively. 



Recall that E is ^-convex on I?2c(7(0)), for #2 as in (17. 131) . Define: 
;i2.35) V={Le fines (5 2C (7(0))) | NCb 2o(?(0) )(5, L) > |} 

;i2.36) = {(L, x) G P I L G V and x G 5 n 5,(7(0)) } . 
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By (14.71) , Markov's inequality (13. 15ft and (111.31) , we have 
(12.37) v (V*) < — ■ p< ujr]r 2 p 

provided that in addition to (112.301) we also assume that 



'12.38) co> 



do 



pr 2 p 3 



An inspection of our choice of parameters (112. 2p . fl 1 2 . 3 [) shows that (I12.38[) holds in our 
setting (note that we used here the requirement c > a/c). 

It follows from fll2.32h - fll2.34p and fll2.37j) that 

(12.39) v(S*UT*UV*) <uo-v{U*) . 
Define 

(12.40) W* = U*\ (S*UT*U V*) . 

Then, for (f,f(0)) G W*, the definitions (fl2T311 . (fl2T36l) . together with LemmadLll imply 
that if we set 



J 

then 



(12.41) Hl(E'nt(J))<-. 

For x e E n B p (j(0)), x = g(i(Q)), put 

(12.42) <S(x) = {fc g(&) | (f, x)eW*} . 

Note that, using the above notation, the set of all f C p(O) with f(0) = x is parameterized 
by the corresponding angle <ft i n the sector given by (112.71) . It follows from (112. 39j) . (112. 32p . 
(112.261) . that there exists a large enough universal constant c > such that if we put 

(12.43) E 1 = {x E E n B p (7(0)) | #(0(a;)) < ccur 2 } . 
then 

(12-44) £ 3 (£i) > ^s(^ n S p (7(0))) > r/p 4 . 

In the following corollary and below, we consider the splitting of the measure 
(12.45) £ 3 = £x £ 2 = H 2 x £ 2 , 

relative to the product structure on EI = M 3 , where £ = 7i 2 corresponds to the center of EI 
(with its induced Tj-snowflake metric) and £ 2 to the plane (x, y, 0) C M 3 . In connection with 
(112.471) below, recall that £3(^(7(0))) x p 4 = p 2 x p 2 , where the product corresponds to 
the decomposition in (112. 45j) . 
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Corollary 12.46. There exists q G ir(Ei) such that 

(12.47) V p 2 < C(n-\q) H E,) < p 2 . 
// (7(7(0)) en- 1 (q)f]E 1 then 

(12.48) C 2 (E> n (<?(§) \ (B p (7(0)) U -B r (7(1))))) < ^r 2 . 

Proof. Relation (1 12.47ft follows from the splitting of the measure in (112.451) and (112.441) . 
Relation fl!2.48j) follows from the definitions of U # ,S#,T*,V#, W*, together with 012.4ip . 
( TT2T42D . (112431) . □ 

Construction of §; completion of the proof of Lemma 112.211 By virtue of (112.471) 

we can choose an interval / C [— p 2 ,p 2 } such that 

(12.49) £(/)x£p 2 , 

(12.50) C(n-\q)nE 1 nq(I))>^p 2 . 
Define a subset A of the center of HI by 

A = 7r- 1 ((o,o))ng- 1 (7r( J E; 1 ))n/. 

Choose g = q^(0)~\ Then g satisfies f fT2TT5|) . and by ([12T49]) . (TT2T50|) . a), b) of LemmaUMI] 
hold. Since by the definition of q , for all ( G A, we have #C7(0) G tt -1 ^) D Corollary 
112.461 implies that part c) holds as well. □ 



Completion of the proof of Proposition 17.101 By choosing c in Proposition 17.101 suf- 
ficiently small, we can assume without essential loss of generality that g of Lemma 112.211 is 
the point (0, 0, 0) and that / is symmetric about the origin. Fix s G [k, 1] and let r, xe with 
< < jog be as in Remark I12.12[ with 9 to be determined below. 

Let S denote the square with center r(s) and side length csr 2 , with one side parallel to 
Xe- Of the two sides of S which are orthogonal to xe, let a denote the one which is furthest 
from the origin (0,0) G M. 2 . Let 71 denote the rectangle with one side parallel to xe and of 
length, 2sr 2 + (s/2), and such the two side which are orthogonal to xe, the one furthest from 
(0, 0) G IR 2 is a. In particular, S C 71. By a standard covering argument, it will suffice to 
show that at most a fraction £ of points of S x [0, (csr) 2 } lie in E'; see (17.161) . Since by our 
assumptions p < ^nr 2 and s > k, 

(12.51) ip 2 < \iisr) 2 , 

it suffices to show that at most a fraction £ of the points of S X ([0, (csr) 2 } \ I) lie in E '. 
Let 

$0 = {L G [xe} I L n (a x ([0, (csr) 2 } \ /)) ^ 0} . 

Note that 5 can (essentially) be described alternatively as consisting of those lines in [xe} 
which intersect 7Z. If follows from (lll.ip that 

(12.52) m](^)>(sr 2 )(sr) 2 = s 3 r\ 
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Moreover, since E is ^-convex on B 2 c (7(0)), there exists 6 E ^0, -f^j such that 
(12.53) / NC B2o{m) (E,L)d» m (L) < ^ . 

We shall fix 9 as in (I12.53j) from now on. It follows from (112.521) . (I12.53[) that there exists a 
universal constant c" > such that if we define 

c"5 2 



(12.54) r 9 = { L e 

then 



NC_B 2O (^( ))(^, L) < 



£s 3 r 6 



Lemma 112.211 implies that 

£3 ((S \ (S p (7(0)) U S r (7(l)))) n £') < u;rV , 

and hence, using (111.11) and Markov's inequality (13.151) . we see that there exists a universal 
constant d" > such that if we define 

(12.56) 37 = ^Led t 

then 



d"t in 2 

K ((S \ (fl„(7(0)) u 5 r (7(i)))) n£'ni)< -^f 



;i2-57) A*DB5] (ST) > (l " f ) /*K5] (&) • 



Note that the definitions of #0 and S, together with (112.231) . (112. 241) . imply that for all Le^ 
we have 

(12.58) H\ ((§ \ (S p (7(0)) U B r (7(l)))) H B. (1 _ s) (7(0)) n l) > ^p 2 . 
Thus, by the definition of , and recalling that s > k, we see that provided that 

(12.59) u < c*r]^K 3 r 2 , 

where c* > is a small enough absolute constant, we have for all L G Jg*, 

(12.60) wi (e n B aM) (7(0)) ni)> ^ P 2 . 

We shall choose c = c* in (112.31) . thus completing our choice of u;. 
Assuming also that for a small enough constant, c** > 0, we have 

(12.61) 5 2 < c**^Vr 6 p 2 , 

which follows from our choice of 82 in (112.21) . provided c is small enough, we conclude from 
the definition of $g, together with (112.581) . that for all Leg™, 

(12.62) NC B2cm)) (E,L)<^p 2 , 
and 

(12.63) Hi (E n B a(1 _ f) (7(0)) ni)> NC B2cim) (E, L) . 
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Recall also, assumption (17.151) . which implies that C$(E' fl B r (^{l))) < 5ir 4 . Moreover, 
by the definition of 5e we have TL\ (L fl B r (j(l))) > r for all L G $g. 

For a sufficiently small universal constant c"" > (see below) define 

(12.64) $* 6 ** = {L G |Wi (L n B r (j(l)) n £) > c""r } . 
By Markov's inequality 

<5ir 4 

Therefore, if for a sufficently small universal constant, c*** > 0, we have, 

(12.65) 5 1 < c***K 3 £r, 
it follows that 



;i2.66) ^ > (i - 1) 



Note that our assumption (112. ip . together with p < |Kr 2 , implies that (112. 65p holds provided 
c is small enough. 

For every L G 5<T* we know that 

112.621 

Hl(LnB r (j(l))nE)>c'"r > NC B2c{m) (E, L), 

provided that 

(12.67) r]£p 2 < Ac"'r , 

which holds if c is small enough. In combination with (112.601) . (112.621) . (112. 63p . (112.661) and 
Lemma [11.71 we obtain the required result. □ 

13. Appendix 1: The Sparsest Cut problem 

As mentioned in Section [TJ Theorem II. 1| and in particular, Corollary ll.5| leads to an 
exponential improvement of the previously best known [13] lower bound on the integrality 
gap of the Goemans-Linial semidefinite relaxation of the Sparsest Cut problem with general 
demands. We now add some additional details to the discussion of Section [TJ for further 
details, we refer to see also our paper [24J, and the references therein. 

The Sparsest Cut problem with general demands (and capacities) asks for an efficient 
proceedure to partition a weighted graph into two parts, so as to minimize the interface 
between them. Formally, we are given an n- vertex graph G = (V, E), with a positive weight 
(called a capacity) c(e) associated to each edge e G E, and a nonnegative weight (called a 
demand) D(u,v) associated to each pair of vertices u, v G V. The goal is to evaluate in 
polynomial time (and in particular, while examining only a negligible fraction of the subsets 
of V) the quantity: 



;i3.1) $*(c,D) = min 



J2uveE c ( uv ) \Xs{u) - Xs{v)\ 



^ s ^ v T, u ,vev D ( u ' v ) \Xs(u) - Xs(v) | ' 
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To get a feeling for the meaning of $*, consider the case c(e) = D(u,v) = 1 for all e G E 
and u, v G V. This is an important instance of the Sparsest Cut problem which is called 
"Sparsest Cut with Uniform Demands" . In this case $* becomes: 

/, o ^ • #{ ed § es joining S and V \ S} 
( 13 - 2 ) $ = mm loi i77\ a\ ' 

m^scv \S\-\V\S\ 

Thus, in the case of uniform demands, the Sparsest Cut problem essentially amounts to 
solving efficiently the combinatorial isoperimetric problem on G: determining the subset of 
the graph whose ratio of edge boundary to its size is as small as possible. 

^From now on, the Sparsest Cut problem will be understood to be with general capacities 
and demands. These allow one to tune the notion of "interface" between S and V\S to a wide 
variety of combinatorial optimization problems, which is one of the reasons why the Sparsest 
Cut problem is one of the most important problems in the field of approximation algorithms. 
It is used as a subroutine in many approximation algorithms for NP-hard problems; see the 
survey article [60], as well as the references in [39l El [5j [23], for some of the (vast) literature 
on this topic. 

The problem of computing $*(c, D) in polynomial time is known to be NP hard |59j. The 
most fruitful approach to finding an approximate solution has been to consider relaxations 
of the problem. Observe that the term ds '■— \xs(u) — Xs(v)\, occuring in (j!3.ip is just the 
distance from u to v in the elementary cut metric associated to S; see Section [2j One is 
therefore led consider the minimization of the functional 

(13.3) 

where d varies over an enlarged class of metrics. The following successively larger classes of 
metrics have played a key role: elementary cut metrics (as in (113.11) ). L\ metrics (equivalently, 
cut metrics), metrics of negative type, arbitrary metrics. Denote the later two collections 
of metrics by NEG, MET, and denote the corresponding minima by $*(c, D) > &*l 1 {c, D) > 
^neg( c ' D) > $met( c ; D). From the standpoint of theoretical computer science, a key point 
is that $met( c > D) is a linear program (since the triangle inequality is a linear condition) 
and $^ EG (c, D) can be computed by the Ellipsoid Algorithm. Thus, both are computable 
in polynomial time with arbitrarily good precision. 

Let d denote an L\ metric and let denote the cut measure occuring in the cut metric 
representation d = J2scv ^d(S)ds of d; see ( 12 . Tjl . By substituting the cut metric representa- 
tion into (TnrSjl . it follows directly that in fact, $*(c,D) = $* Ll (c,D), for all c,D; [T0l[53l[9]. 
The key point is that L\ metrics are the convex cone generated by elementary cut metrics 
(and a corresponding statement would hold for any such convex cone and its generators). 

If di, o?2 are any two metrics on V, define their distortion by 

di(u,v)\ ( d 2 (u,v) 
max 



/ di(u,v)\ ( 
distfrfi, d 2 ) = max — • r 

\u,veV d2{U, V) J \u 



di(u,v) 



It follows immediately that for all c, D, 
13.4 max<— - — — — — , — — — — — > < dist di, do )■ 
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Thus, 



(13.5) SU P rh* 1 / n\ - SU P c i( V ' rf )' 

c,-D ^NEG^'-^J rfeNEG 

(13.6) sup— ^— — < sup Ci(V,d). 

c,D ^METK ^) deMET 

Recall that by definition, the left-hand sides in (113. 5p . fl 1 3 . 6 [) are the integrality gaps for 
the corresponding relaxations. Since $*(c, D) = Q* Li (c,D), Bourgain's embedding theorem 
implies that the first of these integrality gaps is < log n; E] • Similarly, for the Goemans- 
Linial semidefinite relaxation, the embedding theorem for metrics of negative type given in 
[3] implies the the integrality gap is < (logn) 1 / 2+ °^ 1 \ (Of course, the upper bound in terms 
of distortion also applies if L\ metrics are replaced by the original elementary cut metrics, 
but since these metrics are so highly degenerate, it does it not provide useful information. 
This illustrates the power of the observation of [53j [9].) 

From now on we restrict attention to the Goemans-Linial semidefinite relaxation. (What 
we say also applies mutadis mutandis to the relaxation to MET.) As in (113.51) . for all c, D, 
and all d 2 G NEG, 

$* (c,D) 

(13 - 7) to^^- 

A duality argument (sketched below) shows that for all d 2 , their exist c, D for which (113.71) 
becomes an equality. Therefore, the integrality gap is actually equal to sup dgNEG Ci(V, d). 

Thus, by Corollary 11.51 (see also Remark 11.71) the integrality gap is > (logn) 6 for some 
explicit 5 > 0. Recall that the previous best bound was > log logn; see [13], [¥T] . 



Here is a sketch of the duality argument; for further discussion, see Proposition 15.5.2 
of [33]. Consider a set V of cardinality n, the vertices of our graph (whose edge structure 
will be determined below). Define an embedding F of the metrics on V into R n(n ~ 1 )/ 2 as 
follows: The coordinates x U)V correspond to unordered pairs u, v of distinct vertices in V, 
and x U;V (F(d)) = d(u,v). The image of the L\ metrics on V is a convex cone C C R n ( n_1 )/ 2 
generated by the images of the elementary cut metrics ds- Let d 2 satisfy F(d 2 ) ^ C By an 
easy compactness argument, the distortion dist(<i, d 2 ), for d with F(d) e C, is minimized by 
some di, with F(di) G C. Minimality implies that in fact, F(di) G dC. Take a supporting 
hyperplane, P, for £ which passes through F(di). Let I denote a linear functional satisfying 
i\p = 0, £\c < and £(F(d 2 )) > 0. Let J2( u v) v) denote the coordinate representation 

of I. Define capacities by: c{uv ) = £ UjV if £ UjV > and otherwise c{uv ) = 0. Define demands 
by: D(u,v) = —£ U)V if £ U)V < and otherwise D(u,v) = 0. Define a graph structure with 
vertices, V, by stipulating that uv G E if and only if c{uv) > 0. It is trivial to check 
that for the above c, D, we have $(c, D,d) > 1, for all d with F(d) G C. Also ^(d^ = 1, 
dist(di,d 2 ) — ci(v,ck) ' These relations imply that for these c, -D, (| 13.7 |) is an 
equality. By choosing d 2 G NEG such that ci(V, d 2 ) is maximal, it follows that (113.51) is also 
an equality. (If F(d 2 ) G C, this is trivial.) 
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14. Appendix 2: Quantitative bounds, coercivity and monotonicity 

In this Appendix, we briefly discuss from a more general standpoint, the structure of our 
argument, as outlined after the statement of Theorem 1 1 . 1 1 and in SectionHl We point out that 
in essence, what we have done follows the general scheme of other arguments in geometric 
analysis and nonlinear partial differential equations; compare Example 114.11 b elow . Typically, 
the results are not stated explicitly in quantitative form. Here, we wish to emphasize that 
the possibility of an estimate of the form of (11.2j) is actually implicit in the arguments. The 
crucial ingredient is a quantity which is coercive, monotone and bounded. 

Coercivity and almost rigidity. The term, rigid, connotes special (i.e. highly con- 
strained) structure. A standard feature of (the statment and proof of) rigidity theorems 
is the existence of a numerical measurement Q > which is coercive in the sense that if 
Q = 0, then the desired rigidity holds, and more generally (and often much harder to prove) 
if Q < e a , for some a < oo, then in a suitable sense, the structure is e-close the one which 
is obtained in the rigid case. Statments of this type are known as a stability theorems, 
e-regularity theorems, almost rigidity theorems. A classical example from Riemannian ge- 
ometry is the sphere theorem, in which the coercive quantity is minus the logarithm of the 
pinching; see [13] and the references therein. 

In our case, we are given £cl, and the coercive quantity is Q(E) = NM Br ( x \(E), the 
nonmonotonicity of E on B r (x). Coercivitity is the statement that monotone subsets are 
half-spaces, or more generally, that almost monontone subsets are close to half-spaces; see 
Theorem 14.131 

Bounded monotone quantities and existence of a good scale. As in Sections [TJ, [21 
we point out the general character of the estimate in Proposition 14.281 for the scale on which 
Theorem 14.131 can be applied. Namely, by Markov's inequality ( 13. 15[) (which in this case 
amounts to the pigeonhole principle), such an estimate for the scale will appear whenever we 
are dealing with an a priori bounded nonnegative quantity, which can be written as a sum of 
nonnegative terms, which correspond to the various scales, such that each term is coercive 
on its own scale (in the suitably scaled sense). Such a quantity is monotone in the sense that 
the sum is nondecreasing as we include more and more scales; compare f!4.3ip . (16.111) . 

Quantities which are coercive and monotone are well-known to play a key role in geometric 
analysis and in partial differential equations. In the latter case, for evolution equations, 
monotonicity is defined with respect to the time parameter, rather than the scale. 

We include below the following illustrative example, which requires familiarity with Rie- 
mannian geometry. Numerous other choices from diverse areas would serve equally well; 
compare Remark 12. 1[ 

Example 14.1. A theorem from Riemannian geometry states that for noncollapsed Gromov- 

Hausdorff limit spaces, M™ Y n , such that RicM™ > —{n — 1) for all i, every tangent 
cone Y y is a metric cone; see Remark 4.99 of [T7| and [TB]. In this case, the rigid objects are 
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metric cones and the relevant coercive quantity, Q, is derived from the volume ratio, 



(14 2) 

1 ■ ' Vol(B r (p)) ' 

where p e M n and M n denotes the hyperbolic n-space with curvature = — 1. The coercivity 
of Q is guaranteed by the "volume cone implies metric cone theorem" and its corresponding 
almost rigidity theorem, which states that almost volume cones are close in the Gromov- 
Hausdorff sense to be almost metric cones; see [16], [17]. The monotonicity of Q is a conse- 
quence of the Bishop-Gromov inequality, which asserts that the volume ratio in (I14.2p is a 
monotone nonincreasing function of r. 
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